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ABSTRACT 

We discuss the measurements of the galaxy cluster mass functions at z r; 0.05 and z « 0.5 using high-quaUty 
Chandra observations of samples derived from the ROSAT PSPC All-Sky and 400 deg^ surveys. We provide a 
full reference for the data analysis procedures, present updated calibration of relations between the total cluster 
mass and its X-ray indicators {Tx, Mg^s, and Yx) based on a subsample of low-z relaxed clusters, and present a 
first measurement of the evolving Lx - M(ot relation (with M,ot estimated from Yx) obtained from a well-defined 
statistically complete cluster sample and with appropriate corrections for the Malmquist bias applied. Finally, 
we present the derived cluster mass functions, estimate the systematic uncertainties in this measurement, and 
discuss the calculation of the likelihood function. We confidently measure the evolution in the cluster comoving 
number density at a fixed mass threshold, e.g., by a factor of 5.0 ± 1.2 at M500 = 2.5 x 10^* M© between 
z = and 0.5. This evolution reflects the growth of density perturbations and can be used for the cosmological 
constraints complementing those from the distance-redshift relation. 

Subject headings: catalogs — galaxies: clusters: general — surveys — X-rays: galaxies 



1. INTRODUCTION 

This work continues a series of papers in which we present 
the data for a new X-ray selected sample of galaxy clusters — 
the 400d survey — based on the data fro m the J^OSAT PSPC 
pointed observations. In the first paper dBurenin et al.|200^ 
Paper I hereafter), we presented the cluster catalog and de- 
scribed the survey's statistical calibration (selection function, 
effective area and so on). A complete high-redshift subsample 
of the 400d clusters, 36 objects at z = 0.35 - 0.9 with (z) = 0.5, 
has been observed with Chandra. The goal of this program was 
to provide X-ray data of sufficient quahty for reliable estimates 
of the high-redshift (z ~ 0.5) cluster mass function. 

Chandra exposures were designed to yield at least 1500- 
2000 photons from each cluster. This is sufficient to measure 
several high-quality total mass proxies — average temperature 
excluding the center, integrated gas mass, and the Yx parame- 
ter (the product of tempeature and gas mass derived from X- 
ray data). The resulting mass estimates are much more reliable 
than what was achievable in many previous stu dies where the 
only available mass indicator was the X-ray flux Kravtsov et al. 
(j2006 1. Using several mass proxies also allows us to control the 
systematics by checking the consistency of results obtained by 
different methods. 

Observations of the high-redshift 400d clusters are comple- 
mented by Chandra archival data for a complete, flux-limited 
sample of nearby clusters detected in the ROSAT All- Sky sur- 
vey (49 objects at present, expected to grow by a factor of 1.5 in 
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the near future as the completeness of the Chandra archive ex- 
pands to lower fluxes). Chandra data for nearby clusters, com- 
bined when necessary with the ROSAT PSPC pointings, allow 
us to measure the same set of total mass proxies in local and 
distant clusters. 

The present work is a significant step forward in provid- 
ing observational foundations for cosmological work with the 
cluster mass function. First, it uses a larger sample of high-z 
clusters than the previous studies. For example, the best pub- 
lished measur ement o f the evolution in the cluster tempera- 
ture function ( Henry 2004) was based on 25 low-z objects and 
19 clusters with (z) = 0.43. Chandra provides much higher- 
quality data for each high-z object than were available before. 
Second, we use a more advanced approach to the X-ray data 
analysis, partly because this is called for by the Chandra data 
and partly because of the experien ce learned from recent deep 
observations of low-z clusters (e.g., VikhLinin et al.|2006) l. Last 
but not least, the data for high and low-z samples were ob- 
tained with the same instrument and analyzed uniformly, min- 
imizing the potential for systematic errors — the crucial ingre- 
dient for precise measurement of the evolution of the cluster 
mass function. 

In this paper, we present the analysis of the Chandra ob- 
servations of our cluster sample, describe our approach to the 
cluster total mass estimates, derive the evolving M - Lx rela- 
tion, and describe the computation of the survey volume as a 
function of mass. We conclude by presenting the cluster mass 
functions estimated in the "concordant" ACDM cosmology. 
The cosmological modeling of the cluster mass function data 
is presented in an accompanying paper (Vikhlinin et al.. Pa- 
per III hereafter). The prime goal of this work is to provide a 
full reference of the data reduction procedures and discuss the 
sources of systematic uncertainties in the cluster mass function 
estimates at low and high redshifts. 

All distance-dependent quantities are computed assuming a 
ACDM cosmological model with CIm = 0.30, and Ha = 0.70. 
We also assume h = 0.72, unless the expHcit h-scahng is given. 
The luminosities and fluxes are in the 0.5-2 keV energy band. 
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Fig. 1. — The limiting ROSAT flux for selection in the AOOd-Chandra sample, 
as a function of redshift. At z > 0.473, the limiting flux is 1.4 x 10"'^, that 
in the 400d catalog. At 0.35 < z < 0.473, the flux limit corresponds to the 
minimum luminosity specified in eq. [T] . 

2. CLUSTER SAMPLES 

2.L High-Redshift Sample 

Our high-redshift duster sample is a well-defined subsample 
of the z > 0.35 clusters from the 400d survey. The selection was 
designed to provide a quasi mass-Umited sample at z < 0.5 
by requiring that the i^OSAT-derived luminosity was above a 
threshold of 



4.8 X lO"*^ (1 + ergs' 



(1) 



in the default ACDM cosmology. This luminosity threshold 
approximately corresponds to a mass Umit of 10^* M© from 
the low-z Lx-M relation. The redshift factor here corresponds 
to an early measurement of the evolution in the Mgas - L rela- 
tion l |Vikhlinin et al.|20^2| . The resulting selection is entirely 
objective and in tact is torrnulated as a redshift-dependent flux 
Umit (shown in Fig.[TJ. At z > 0.473, no additional selection 
is appUed since the minimum flux of the main 400d sample 
satisfies the luminosity threshold in eq. lfT|l. 

Thirty nine objects from the 400d catalog satisfy these se- 
lection criteria, and all were observed with Chandra. For three 
clusters, 0216-1747, 0521-2530, 1117+1744, the accurate to- 
tal X-ray flux measured by Chandra was < 10'^'' erg s'^ cm'^, 
significantly below the target minimum flux in the 400d cat- 
alog, 1.4 X 10"'^ erg s"' cm'^. The existence of such devia- 



tions is expected (see, e.g.. Fig. 23 in Paper II because ROSAT 
flux estimates have large statistical errors. However, the com- 
putation of the 400d selection function in this flux regime is 
less accurate because it depends strongly on the wings of the 
distrib ution of the flux measurement scatter (see §7.1 in Pa^ 
per I| for details). We, therefore, opted not to use these three 
clusters in the further analysis. The additional selection crite- 
rion, /true > 10"'^ erg s'^ cm'^, will be taken into account in 
the sample volume computations. The final sample of 36 high- 
redshift clusters we will use hereafter is presented in Tablefl] 

2.2. Low-Redshift Sample 

The low-redshift cluster sample was selected, similarly to 
the procedure described in ,Voevodkin & Vikh linin, (2004 j . 



Fig. 2. — Redshift distribution of clusters in our low and high-z samples. 



from several samples based on th e ROSAT All-Sky Survey 
_(RASS) data (Ebeling et aL| |2000| — BCS; jd e Grandi et al. 
1999.,B6hringer et al 2004p - REFLEX;l Reiprich & Bohringer 
2002I — HlGFLUGcliJ. Overlaps between the catalogs were 



removed. The objects at Galactic latitude \b\ < 20°, as well 
as those around LMC, SMC, and the Virgo cluste r were ex- 
cluded (the exclusion regions were adopted from Reiprich &j 
jBohringer 2002). The total area covered by these catalogs is 
8J4 sr. The X-ray fluxes were remeasured (starting from a hst 
of objects with cataloged fluxes / > 5.3 x 10'^^ erg s'^ cm'^ 
in the 0.5-2 keV band), using the data from pointed ROSAT 
PSPC observations, when available. Our final sample con- 
sists of 49 clusters (Table |2} with the re-measured flux / > 
1.3 X 10"'^ erg s"' cm'^ in the 0.5-2 keV band, well above 
the sensitivity limit of all initial RASS cluster catalogs, and 
z > 0.025 (the lower redshift cut was used to ensure that a large 
fraction of the cluster virial radius fits inside the Chandra field 
of view). All objects in this sample have archival Chandra ob- 
servations, providing accurate X-ray spectral data. 

2.3. General Characteristics of the Cluster Samples 

The combined cluster sample is a unique, uniformly ob- 
served dataset. The volume coverage and effective mass limits 
in the low and high-redshift subsamples are similar {M-^^^ ~ 
(1 -2) X 10^"* Mq). The median mass at allredshifts is near 
M500 = 2.5 X 10'* h'^ M©, which corresponds to T = 4.5 keV 
clusters at z = 0. Observations suggest that clusters of such 
and larger mass exhibit scalings between their observables and 
mass close to the expectations of self-similar model (Nagaij 
et al.|2007a] l, which makes our sample particularly useful for 



cosmofogical applications. 

Our cluster sample is selected essentially using only the X- 
ray flux. Cluster detection efficiency is, in principle, also de- 
pends on the object surface brightness. However, the surface 
brightness effects are minimal for our objects. For the low- 
z sample, this is achieved by selecting objects with fluxes a 
factor of > 5 higher than the detection threshold in the par- 
ent ROSAT All-Sky Survey samples. For the 400d clusters, we 
used a highly sensitive detection method tailored for finding 
extended sources. The resulting sensitivity of the detection ef- 
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TABLE 1 

HiGH-REDSHIFT CLUSTER SAMPLE 



Name 


z 


Lx 




Tx 


My 


Mg 


Mr 




fROSAT^ 


Merger? 






ergs- 


-1 


keV 


10'" M0 


10'" Mq 


10'" Mq 


10-^^ cgs 


10"^^ cgs 




(1) 


(2) 


(3) 




(4) 


(5) 


(6) 


(7) 


(8) 


(9) 


(10) 




3501 


5 24 X 




4 78 ± 75 


79 -L "^R 

J. / Zi ± U.JO 


'I SR -1- 9R 
J.JO It u.z,o 


'I 9(^ _L 77 

J.ZU ZlZ \J. / / 


15 34 


15 9+19 




1 91 9-1-97^^ 




3.61 X 


q44 


6 69 -(- RQ 


6 1 7 4- 57 

KJ. 1. / ^ U.J/ 


5 69 -1- 37 

J.UZ ^ U.J/ 


6 16-1-1 94 


10.53 


12.5 +1.7 




o^so— ■^Jini 


3631 


6.80 X 


lO" 


9 4S 4- 50 


1.43 ± 0.19 


1 40 -(- 1 R 

i.TU ^ u.io 


1.34 it 0.41 


1.68 


2.9 + 0.8 






3700 


1 82 X 


io« 


4 04 4- (1^ 


9 Q9 _l_ 9R 

Z..OjL III u.z,o 


9 44 4- 9 1 

Z . TTTT It U . Z. 1 


9 R(i 4- (i7 

Z.OU It U.U / 


4 63 


4 6 + 05 




01 sQ+nn'^n 


U.JOUU 


1 42 X 


q44 




2 51 it 37 


1 Q9 4- 99 

L . 7Z It U.Z.Z. 


9 f;7 4_ QO 

Z.U / It U.7U 


3 30 


3 3 + 04 




0QSR-(-47n9 


3900 


1.04 X 


q44 


3.57 it 0.73 


1 84 -(- 95 


1.34 it 0.15 


9 03 -(- 63 

Z.UJ ^ U.UJ 


2.22 


2.8 ± 0.6 




0S0Q-(-9Sl 1 


3990 


2.50 X 


[q44 


4.17 it 0.73 




^ QQ _i_ 35 

J.70 ^ U.JJ 


9 Qf; _(_ 7R 

Z.7U ^ u./o 


5.40 


5.5 ± 0.8 






4000 

U.t:UUU 


1 94 X 


lO"" 


^ 9fi -4- 4(1 


9 c;9 _L 94 

Z.. JZ. It U.Z,T: 


■X 1 94 

J. 1 U It U.Z.T: 


1 7(1 4- ^7 

1 . / U It U.J/ 


4 01 


4 + 05 




1 ^1 9-L'iQnn 


4037 


1 37 X 


q44 




9 7=; _(_ ^7 

Z.. / J It U. J / 


9 (i9 -t- 49 

Z.UZ It U.T:Z, 


9 47 4- 1 Ofi 

Z.4:/ It 1 .UU 


2.71 


2.6 ±04 




100S-(-^9S^ 


4161 


1.53 X 


q44 


5.44 it 1.40 


9 80 + 4Q 

Z.OU ^ U.t:7 


1 S7 -1- 90 

J.. J/ ^ U.Z-U 


3 R3 -(- 1 47 

J.OJ ^ ±.t:/ 


3.04 


3.5 ± 0.4 




01 41 — 


4423 


1 32 X 


g44 


9 1 -(- 

Z.. i. J i U.JO 


1 99 _L 17 

J. .ZZ It U. 1 / 


1 "^0 94 

J !>U It U.Z.T: 


1 0"^ 4- 97 
J. .u J It u.z. / 


2 06 


3 1 ± 9 




1 701 +641 4 


4530 


2 39 X 


q44 


4 ^fi -1- 4(1 


^ 9R + 94 

J .ZO It U.Z.T: 


^ 90 90 
J.ZU It u.z,u 


9 (^f; 4_ 49 

Z.UU It U.rrZ, 


3 91 


3.9 ±04 




1 641-^4001 


4(^40 


9 46 X 


o« 


-51 -I. 69 


1 70 + 90 
1 . / U It u.z,u 


1 34 it 13 


1 7^ 4Q 

i. . / J It xj.^y 


1 43 


2 9 ± 8 




OS99— ^694 


4720 


1.04 X 


q44 


^ 46 4- 48 


9 18-1-0 91 


1 R9 4- 1 5 

±.OZ It U.XJ 


9 1 9 4- 45 

Z.XZ It U.rrJ 


1.47 


1.8 ± 0.3 




1 9994^97nQ 


4720 


9 88 X 


q43 


^ 74 4- (1 1 

J . / ^ It U.U 1 


9 OQ 94 

Z.U7 It U.Z,t: 


1 59 it 16 


9 OR 4- 'il 

Z.UO ± U.JJ. 


1.39 


19 + 04 




O^SS— ^741 


4730 


1 76 X 


q44 


4 61 + S9 


09 

J.UZ It U.JJ 


9 44 4- 99 

Z.TTTT i U.Z.Z. 


9 R7 4- 76 

Z.O / ^ U. / u 


2 48 


2 9 + 07 




0KS^4-=i7^Q 


4750 


8 43 X 


o« 


^ 49 4- (i7 

It U.U / 


9 0=i 97 

Z..U J It u.z. / 


1 63 it 17 


9 OQ 4- /il 

Z.U7 i U.UJ. 


1 22 


2 + 05 




OSSS— 94=^6 


4751 


9.79 X 


q43 


^ 1 6 -(- 58 

J.i.U ^ U.JO 


1 QO -(- 99 


1.64 it 0.17 


1 R5 4- 51 

±.OJ It U.Jl 


1.33 


2.4 + 0.5 




0Q9f^4- 1 949 


4RQ0 


1 50 X 


g44 


4 74 4- 7 1 
rt. / rt in U. / 1 


■3 00 "^0 

J.UU It U.JU 


9 09 -^ 1 /i 

Z.UZ It U. lU 


3 42 it 77 


2 04 


17 + 03 




00^0+961 S 


=iOOO 

U.JUUU 


1 57 X 


q44 


5.63 it 1 13 


4^ 4- 41 

J.TiJ It U.Trl 


9 04 4- 1 Q 

Z.Utt ^ U. 1 7 


4 41 it 1 33 


2.09 


2 4 + 03 




1009+fiSSS 


SOOO 


1.71 X 


q44 


4 04 + 83 

t.Ut hi u.oj 


9 80 -(- 40 

Z . O U ^ U . ttU 


9 34 4- 97 

z.jt hi u.Zri/ 


9 65 4- Rl 

Z.UJ It u.ox 


2.19 


2.0 ± 0.4 




1 S94-(-nQS7 


5160 


2.07 X 


[q44 


4.23 it 0.51 


3 94 -(- 97 


3 OR -f- 91 

J.UO It u.z<± 


9 R9 -f- 51 

Z.OZ It U.JX 


2.45 


3.0 + 0.4 




1 ^S7+fi9'^9 


5250 


1 63 X 


q44 


4 (iO 4- fiQ 


9 Qfi 4- 9Q 

Z.7U It U.Z,!7 


9 40 4- 1 R 

Z.TrU i U. J. O 


9 7R 4- /i9 
z. / o ± u.uz. 


1.90 


2 + 03 




1 ^S4— 099 1 


5460 


1 40 X 


g44 


3 77 it 53 


9 -il _L 9^ 
Z.J J. It U.Z J 


1 (lQ 4- 1 
i. .U7 It U. lU 


9 39 4. 4R 

Z.JZ It U.rrO 


1 45 


15 + 02 




1 1 90-1-9^96 


5620 


1.79 X 


q44 


3 58 -(- 44 

J.JO ^ u . 


9 50 -(- 91 

Z.JU ^ u.zx 


9 39 4- 16 

Z.JZf It u.xu 


2.13 it 0.39 


1.68 


2.1 + 0.4 




0QSfi-(-41 07 


5870 


1.85 X 


[q44 


4 40 4- 50 

Tt.TU HI U.JU 


9 _(_ 99 

Z.7J ^ U.ZZri 


2.44 ± 0.14 


9 R7 -1- 4Q 

Z.O/ ^ U.t:7 


1.64 


1.6 + 0.3 




0328-2140 


0.5901 


2.30 X 


1044 


5.14 ± 1.47 


3.42 ± 0.66 


2.92 + 0.38 


3.17+ 1.36 


2.09 


2.1 + 0.6 




1120+4318 


0.6000 


3.75 X 


10« 


4.99 ±0.30 


3.92 + 0.17 


4.20 + 0.24 


3.00 + 0.27 


3.24 


3.0 + 0.3 




1334+5031 


0.6200 


2.22 X 


10"" 


4.31 ±0.28 


2.62 + 0.17 


1.88 + 0.22 


2.73 + 0.27 


1.76 


1.8 ±0.3 




0542-4100 


0.6420 


2.91 X 


10"" 


5.45 ± 0.77 


4.07 + 0.39 


3.70 + 0.25 


3.86 + 0.82 


2.21 


2.2 + 0.3 


■/ 


1202+5751 


0.6775 


2.22 X 


10"" 


4.08 + 0.72 


2.90 + 0.37 


2.85 ± 0.29 


2.42 ± 0.64 


1.34 


1.5 + 0.4 


■/ 


0405-4100 


0.6861 


2.23 X 


10"" 


3.98 ± 0.48 


2.51 + 0.20 


2.17 + 0.16 


2.32 + 0.42 


1.33 


1.5 + 0.4 


■/ 


1221+4918 


0.7000 


3.35 X 


10"" 


6.63 ± 0.75 


4.88 + 0.38 


4.16 + 0.23 


5.04 + 0.86 


2.06 


2.1 + 0.5 




0230+1836 


0.7990 


2.55 X 


10"" 


5.50 ± 1.02 


3.46 + 0.46 


2.70 + 0.27 


3.57 + 0.99 


1.09 


2.2 + 0.6 


■/ 


0152-1358 


0.8325 


5.46 X 


10"" 


5.40 ± 0.97 


3.91 + 0.52 


3.94 + 0.40 


3.40 + 0.91 


2.24 


1.8 ±0.3 


■/ 


1226+3332 


0.8880 


8.42 X 


10"" 


11.08+ 1.39 


7.59 + 0.61 


5.75 + 0.28 


9.91 + 1.86 


3.27 


2.9 ±0.3 


■/ 



Note. — Column (2) — cluster redshift. Column (3) — total X-ray luminosity (0.5-2 keV band, object frame) measured from accurate 
Chandra flux. Chandra fluxes and luminosities have ~ 2% statistical uncertainties. Column (4) — ave rage temperature from the spectrum 
integrated in the [0.15 - Ij rsoo annulus. Column (5) — total mass estimated from Yx parameter (§ |4.3| . Column (6) — Mtot estimated from 
integrated gas mass ( §|4.2| . Column (7) — mass estimated from the Mtot - Tx relation ( §|4.1[ . Column (8) — total X-ray flux measured by 
Chandra (0.5-2 keV, observer's frame). Column (9) — total X-ray flux (0.5-2 keV, observer s tram e) reported in the 400d catalog from ROSAT 
PSPC data. Column (10) — approximate classification into mergers and relaxed clusters (§|4.1.3[. 



ficiency to the cluster angular size has been extensively studied 
([Paper I) and found to be small. Furthermore, optical identi- 
fications also played no role in selecting the sample — essen- 
tially all X-ray candidates at both low and high redshifts were 
identified as galaxy clusters. Therefore, we do not miss objects 
because of misclassification caused by the presence of central 
or background AGNs. 

The redshift histograms for the low and high-redshift sam- 
ples are shown in Fig. [5] The depth of the low-redshift sample 
is z ~ 0.15; there are only 3 clusters beyond this z. Therefore, 
the low-redshift sample is effectively "local" and it gives us a 
snapshot of the cluster population at z « 0. The high-redshift 
sample starts at z = 0.35 and extends to z = 0.9. The median 
redshift of the distant sample is (z) = 0.5. 

Data of sufficient quality are available for utilizing three dif- 
ferent X-ray total mass proxies for all our clusters. These ob- 
servations provide us with a reliable measure of the evolution 
of the cluster mass function between z r* 0.5 and 0, or over 
« 37% of the present age of the Universe. 



3. CHANDRA AND ROSAT DATA REDUCTION 

Chandra observations provide the basis for our X-ray anal- 
ysis of both high and low-redshift clusters. We also make use 
of the ROSAT PSPC data for the low-z objects (pointed obser- 
vations when available and All- Sky Survey data for 8 objects). 
In low-z clusters, the statistical accuracy of the X-ray surface 
brightness determination at large radii is hmited mostly by 
the Chandra field of view. The analysis in such cases bene- 
fits from using the ROSAT data that cover a much larger re- 
gion although with a lower sensitivity. Below, we discuss the 
issues related to the initial data preparation, spectral analysis, 
and producing the "calibrated" X-ray images. How these data 
are used to derive th e basic I CM parameters and the cluster 
Mtot is discussed in §|33][3!4] and|4] 

3.1. Initial Data Reduction and Calibration Uncertainties 
3.1.1. Chandra 

For Chandra, our data reduction procedure is adopted with 
no changes from ^Vikhlinin et al.^(2005- V05 hereafter). This 
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TABLE 2 

LOW-REDSHIFT SAMPLE 



Name 






Lx 




Tx 


My 


Mg 


Mr 


Merger? 




10"^^ cgs 




ergs- 


-1 


keV 


10" Mq 


10" M0 


10" M0 




(1) 


(2) 


(3) 


(4) 




(5) 


(6) 


(7) 


(8) 


(9) 


A 3571 


7.42 




2 37 X 


lO"'* 




S QO 4- 0(1 
J.i/U It u.uu 


S "^0 4- 07 

J. JU It U.U / 


fil 4-0 IS 
U.UJ. Hi U. 1 J 




A2199 


6 43 


0.0304 


1.27 X 


()44 


3.99 ± 0.10 


? 77 + 05 


? so 4- 04 

Z<.OU I^ U.Ut: 


? Q? 4- 11 

Z.7Z HI U.XX 






6 24 


0.0346 


1.60 X 


IqU 


3 43 4- 10 


? 33 4- 05 

JJ HI U.UJ 


? S3 4- OS 

Zi.JJ I^ U.UJ 


? 3? 4- 1 1 

Z.JZ ^ u.±± 




A 496 


5 33 


0328 


1 23 X 


lO"'* 


4. 1 ? 4- 07 


? Q^i 4- 04 

Z, . 1/ U It U . Ut: 


^ 0? 4- 04 
J.UZ Hi U.Utt 


■3 07 4- 07 

J.U / It U.U / 




A 3667 


4 64 


0557 


3 14 X 




_L 0^> 


7 4- 07 

/ . J J It U.U / 


S fi? 4- IS 
O.UZ Hi U. 1 J 


74 4- OQ 

\j. / rt It U.U!? 




A 754 


4 35 


0.0542 


2.78 X 


qU 


8 73 -(- 00 
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Note. — Columns (3)-(9) have the same meaning as in TablelTl Column (2) gives the total flux (0.5-2 keV) f^rom the best 
source available {Chandra if cluster the cluster is at sufficiently hign redshifl; to fit the field of view, J^OSAT PSPC pointing, and 
re-measurement from the All-Sky survey data as a last resort). 

Redshifts were converted to the CMB reference frame. 



includes careful filtering for high background periods and ap- 
plying all the latest calibration corrections to the detected X- 
ray photons, and determination of the background intensity in 
each observation. 

The quiescent Chandra background is dominated by the 
events induced by charged particles. This component can 
be subtracted exquisitely ac curately (with a < 2% scatter, see 
[Hickox & Markevitch|2006] l. A much smaller contribution is 
provided by a fraction of the cosmic X-ray background not re- 
solved into discrete sources. This component is modeled ad- 



equately by using the "blank-sky" background datasets which 
include both the particle-induced and unresolved sky compo- 
nents. Finally, there is a non-negligible diffuse soft component 
attrib utable to the Galactic ISM emission (Markevitc h et al.| 
2003|l and in some ca ses, to the geocoronal charge exchange 
( Wargelin et al. '20041. The soft background component is the 
hardest to model because its intensity depends on the pointing 
direction, and can even be variable in the case of charge ex- 
change emission. Fortunately, the soft component can still be 
subtracted sufficiently accurately because it is separated spec- 
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Fig. 3. — Typical examples of X-ray images for the low-redshift clusters (A85, A2163, and A2597 top to bottom). Left panel show the Chandra images (each panel 
is 50' X 50'). ROSAT PSPC images (64' x 64') are shown on the right. Yellow circles show detected sources unrelated to the clusters; the general increase of their 
radius at l arge off- cluster distances reflects the degradation of the telescope PSF. The red c ircles indicate the cluster substructures that were removed from the profile 
analysis (§ |3.2[ . The red crosses mark the location of the adopted cluster centroid (§ |3.2[ . 
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trally from the cluster emission (sin ce it is dominated by emis- 
sion lines near 0.6 keV, see § 2.3.2 in |V05l l. 

Uncertainties in determining each of the background com- 
ponents were propagated in the further analysis. Their impact 
on the analysis of the Chandra cluster observations is exten- 
sively discussed in V05 Here we only note that this source 
of uncertainty is negligible for the measurements of the aver- 
age cluster temperatures dominated by the bright inner region; 
similarly, the gas mass measurements are based on the surface 
brightness profiles in the soft band where the background is 
lower relative to the cluster flux. 

Conversion of the observed X-ray fluxes to physical quanti- 
ties such as the temperature and density of the intracluster gas 
relies on accurate calibration of the spectral response. An ex- 
tensive pre-flight calibration program was designed to provide 
absolute calibration of the effective area of the mirror-hACIS 
system to within 2% at all locations and across the entire en- 
ergy band. The in-flight performance was degraded somewhat 
but by 2005-2006, the calibration accuracy was restored to 
near-preflight levels. Currently, the uncertainty in relative (po- 
sition and time-dependent) variations of the effective area is 
< 3% within the energy band we use in the present work'. The 
estimated uncertainties in the absolute effective area are < 5% 
at all energies. The systematic effect of such uncertainties on 
t he esti mated cluster mass function is small, as summarized in 
§ |8.1.1| We also note that the calibration uncertainties in the 
measurement of the evolution of the mass function are nearly 
canceled because we use the same telescope and uniform anal- 
ysis of both low- and high-z samples. 



3.1.2. ROSAT 
The ROSAT PSPC data were reduced as 



described in 
on 
are 



Vikhlinin et al. The reduction p ipeline was based 

S. Snowden's software i jSnowden et al.j |l994l. This softw 
eliminates periods of high particle and scattered solar back- 
grounds as well as those intervals when the detector may be 
unstable. Exposure maps in several energy bands are then cre- 
ated using detector maps obtained during the ROSAT All-Sky 
Survey. The exposure maps include vignetting and all detec- 
tor artifacts. The unvignetted particle background is estimated 
and subtracted from the data, even though the PSPC parti- 
cle background is low compared to the cosmic X-ray back- 
ground. The scattered solar X-ray background also should be 
subtracted separately, because, depending on the viewing an- 
gle, it can introduce a constant background gradient across the 
image. Most Solar X-rays were eUminated by simply excluding 
time intervals when this emission was high, but the remaining 
contribution was also modeled and subtracted. If the cluster 
was observed in several pointings, each pointing was reduced 
individually and the resulting images were merged. 

The energy resolution of the ROSAT PSPC is insufficient 
to separate the soft background components spectrally, which 
was possible in the case of Chandra. However, the ROSAT field 
of view is much larger and usually we can reliably measure 
the uniform background level from the cluster observations 
themselves. Our procedure for the background determination 
was to fit the observed surface brightness profile at large radii, 
r > 0.7 rsoo, to a po wer law plus constant model (as discussed 



m 



Vikhlinin et aI.|[T999| |. The additional power law compo- 



nent is required since at lower z, the contribution of the clus 

' The current status of the Chandra cahbra tion i s summarized on the 
WWW page http:/ / cxc.har vard.edu/cal| See also |V05| for discussion relevant 
to the cluster data analysis. 



ter brightness is small but non-negligible even near the edge of 
the ROSAT PSPC field of view. The tests show that this proce- 
dure provides a re lative uncertainty in t he background deter- 
mination of ~ 5% l|Vikhlinin et al. 1999| l. This uncertainty was 
propagated into the further analysis. 

The Hmited bandpass of the ROSAT PSPC (limited to £ < 
2 keV) does not allow one to measure the cluster temperatures 
with an accuracy useful for our purposes. However, the ob- 
served count rate can be converted to a broad-band flux very 
reliably, as confirmed directly by excellent agreement with the 
Chandra-deiived flux from the same region (see §|3.4|. 



3.2. Removal of Substructures and Identification of the Cluster 

Center 

After the initial data preparation, we have flat-fielded 
and background-subtracted images in the 0.7-2 keV energy 
band^". These images contain only the cluster emission and 
other X-ray sources. Our next step is to remove all point-like 
sources, as well as substructures within the cluster. The point 
source removal is the most straightforward step. Our detec- 
tion routine is based on the wavelet decomposition technique 
documented in Vikhlinin et al. ( 1998) . The point sources are 
identified using the small scales of the wavelet decomposition 
and the corresponding regions are masked out from all fiirther 
analysis. The exclusion radius takes into account the variation 
of the PSF size with the offaxis angle (this is especially impor- 
tant in the case of ROSAT PSPC pointed observations). 

We also mask out any detectable, well-defined substructures 
within the cluster (they are included only in the total X-ray lu- 
minosity). The detection of substructures was fully automatic 
and based on the analysis of large scales of the wavelet decom- 
position process. We masked out only the regions associated 
with the prominent secondary maxima in the X-ray surface 
brightness, keeping the weaker components such as filamen- 
tary structures. Examples are shown in Fig.[3]and[4] Removal 
of obvious substructure reduces the scatter in the relation be- 
tween the total mass and X-ray proxies, although the effect is 
small in most cases because we exclude only a small fraction 
(< 20%) of the total flux. We note that removal of substruc- 



tures w as included in the mock Chandra analysis ( Nagai et al. 
|2007b| l which we use to assess the uncertainties in the calibra- 
tions of the Mtot vs. proxy relations. 

The only quantity we measure without removing the large 
scale-substructures is the total X-ray luminosity. The luminos- 
ity determines the detectabihty of the cluster in shallow sur- 
veys. These surveys usually lack sensitivity and angular reso- 
lution to remove the substructure and detect the clusters on 
the basis of its total flux. 

The further steps in the X-ray data reduction are based on 
the analysis of the azimuthally averaged profiles. For this, we 
need to define the cluster center in each case. In the case of 
relaxed clusters with cooling flows, the center is defined to 
be simply at the location of the X-ray peak. The situation is 
less straightforward for the non-cooHng flow clusters or those 
with substructure. Instead of using the maximum in the X-ray 
brightness map, we center the profiles at the "center of gravity" 
for the main cluster body. This is done by computing the mean 
emission-weighted coordinates using the X-ray brightness in 
the annulus r = [250 - 500] kpc, and iterating this procedure 
2-3 times. The selection of the centroids is illustrated in Fig.[3] 

The 0.7-2 keV band is chosen to maximize the ratio of the cluster and 
typical background brightness. 
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3.3. Chandra Spectral Analysis 

There is an important difference in the approach for deter- 
mination of the average temperature from the Chandra data 
for high- and low-z clusters. The procedure is straightforward 
for the high-z objects that fall entirely inside the Chandra field 
of view. In this case, we can measure the average temperature 
simply by fitting a single- T model to the X-ray spectrum in the 
0.6-10 keV band integrated in the radial range of interest, e.g., 
r = (0.15-1) rsoo. This is a common, straightforward analysis; 
the interes ted reader can find all the details of our approach in 
IVikhlinin e t al. (20051. 

The situation is more complicated for low-z clusters where 
typically not all position angles fall inside the ACIS field of 
view at large radii (Fig.Jsl. If we simply fit the integrated spec- 
trum within the ACIS field of view, the contribution of the cen- 
tral region to the total flux will be higher than it should be in 
the case of complete coverage. This introduces a bias if the ICM 
temperature distribution is not uniform. Usually, T is over- 
estimated becau se the ob served T(r) de creases at large radii 
dMarkevitch et a l. 1998 De Grandi & Mo lendi 2002 ; Vikhhnin 
|et aL|2005| [Pratt et al. 20071. Our solution is to measure the 
temperatures independently in several annuli (we use annuli 
of equal logarithmic width, rout/^in = 1-5, within which the 
overall gradient of T{r) can be neglected) and then average 
the obtained temperature profile weighting each bin not with 
observed counts but with the total flux expected in the given 
annulus if it were completely covered with the field of view. 
The surface brightness profile needed to compute this weight- 
ing function can always be derived from ROSAT data that al- 
ways cover the radial range of interest. In principle this is not 
an exact method since this weighting function is proportional 
essentially to the emission measure integral, while the weight- 
ing corresponding to the spectroscopic mean is different ( jMaz-| 
[zotta et al. 2004 Vikhhnin 2006). In practice, however, this 
makes a negUgible difference f or our clusters, as was verified 
using the clusters from the Vikhlinin et al." I 2005| l sample that 
have the adequate radial coverage tor exact computation of 
(T). 

The X-ray spectral model we fit to the observed Chandra 
spectra includes foreground absorption in the Galactic ISM. In 



most cases, the absorbing column density, Nh, was fixed at the 
value provided by radio surveys (Dickey & Lockman^l990 ) but 
we always checked that it is consistent with the observed spec- 
trum. In a few cases (2A 0335, A2634, A478, A2390) the X- 
ray spectrum indicated a significantly higher absorption than 
suggested by the radio data, most likely due to the presence of 
molecular gas and dust along the hne of sight. In these cases, 
Nfj was derived directly from the X-ray spectrum. A caution- 
ary note is that small variations of Nh, of order ±2 x 10^° cm"^, 
cannot be detected in the Chandra spectra'^ because they are 
indistinguishable from variations of the temperature. For the 
typical values Nh = 4 x 10^° cm"^ and T = 5 keV, the variation 
of Nh by ±2 x 10^° cm"^ changes the best-fit temperature by 
±7%, and also changes the derived gas mass by ±3.5%, anticor- 
related with T. Such variations are smaller than the scatter of 



these quantities for a fixed mass ( Kravtsov, Vikhlinin & Nagai 
2006 1 but still should be kept in mind. In this regard we note 



that Yx = Mgas X Tjf is less sensitive to variations of Nh because 
they have the opposite effect on Tx and Mgas. 

The last issue that should be discussed in relation with the X- 
ray spectral analysis is the treatment of the ICM metallicity. In 
low-z clusters, the statistical quahty is sufficient to measure the 
metal abundance simultaneously with the temperature. This is 
impossible for most of our high-z clusters. In these cases, we 
fixed the metalhcity at Z = 0.3 Zp, the typical value at both 



low and hig h redshifts ( [Mushotzky & Loewenstein 1997 Tozzi 
et al. 2003 1. We verified that variations ot Z in the range 0.1 



0.5 (conservative bracket) have a small effect on the derived 
parameters — for a T = 5 keV cluster at z = 0.5 the temperature 
changes by ±5% and Mgas changes by ±2%, correlated with T. 

The instrumental uncertainties in Tx measurement are sys- 
tematic and uniform (do not introduce object-to-obejct scatter 
or any significant redshift-dependent trends). They are consid- 



ered separately in § 8.1.1 



3.4. Gas Mass Measurements 

" Nor in the combined Chandra & ROSAT spectrum in the 0.2-10 keV 
band if the nominal N[f is greater than approximately 5 x 10^° cm"^. We note, 
however, that the ROSAT data were checked for consistency with the nominal 
Nh for all clusters in our low-redshift sample. 
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r, kpc r, kpc 

Fig. 5. — Examples of the surfa ce b rightness profile modeling for clusters shown in Fig.|3]and[4] The observed X-ray count rates are converted to the projected 
emission measure integral (see § |3.4| and V06I. The black and red data points show the Cnandra and ROSAT measurements, respectively. The best fit models (the 
projected emission measure integral for ttie tliree-dimensional distribution given by eq.|2j are shown by solid lines. The dashed lines indicate the estimated rsoo 
radii (the Ix based value, see §|4]i, for reference. Note that in all cases, the surface brightness is traced accurately to rsoo. In relaxed clusters such as A85, the model 
describes the data very accurateTy. In strong merge rs su ch as A2163, we see systematic deviations from the fit. The effect of such deviations on the cluster mass 
proxies was studied in Nagai et al. ( 2007b see also §|3.5|. 



Two of the mass proxies we utilize for the Mtot estimates though the effects of the temperature and metalUcity are very 

(§[4]l use the gas mass within r < rsoo- Derivation of the gas weak, we appUed the appropriate corrections to the observed 

mass from the X-ray imaging data is relatively straightforward, surface brightness profiles as detailed in V06 this correction 

but a few points are still worth noting here. Our procedure for also removes the effects of spatial variations of the telescope 

the Mgas measurements follows that used for a more detailed effective area. The corrected profiles are expressed in units of 

an alysis of a smaller samp le of low-redshift clusters described the projected emission measure integral, / rig tip dl. They are 

inlVikhlinin et al.|(|2006l V06 hereafter), and the main steps deprojected to reconstruct the 3-dimensional profile of Pg{r). 

This is done by fitting the projected data to an analytical model. 



are outlined here tor completeness. 

The X-ray flux in the 0.7-2 keV energy band is ve ry insensi- 
tive to the plasma temperature, as long as T > 2 keV (Fabricant 
et al. 1980 1. The observed brightness gives essentially the inte- 
gral of p't along the Une of sight. This is why the ICM mass 
is robustly derived from the X-ray data even if the detector 
has almost no energy resolution and a limited bandpass. Even 



(r/r,)- 



(1 +r2/r2)3/'-«/2 (l + ry/rsyy/y 
nil 



(2) 
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that represents a ll main features observed in r eal dust ers 
the j8-m odel (Cavaliere & Fusco-Femiano 1978 1 profile l |Jones 



&Forman|1984| that may steepen at large radii, and also show 



a power-law cusp and possibly a separate component in the 
center. These modifications of the j8-model greatly enhance 
the functional freedom and improve the reUability of the X- 
ray modeling at large radii (see discussion in V06 1. 

The parameters of the 3-dimensional model l|2} are obtained 
by numerically projecting it along the line of sight and fitting 
to the observed profile. The best fit directly gives us the an- 
alytic expression for the 3-dimensional profile of Pg{r) which 
can be integrated to determine Mg^s in the given range of radii. 
Several examples of this analysis are shown in Fig.js] Note the 
excellent agreement between the Chandra and RObAT mea- 
surements in the same regions indicating an accurate cross- 
calibration between the two instruments. The uncertainties of 
Pg(r ) and Mg^s are derived via Monte-Carlo simulations (see 

3.5. Verification by Mock Observations of 
Cosmological Simulations 

We note that our approach to the measurements of the ICM 
mass and average temperature has been fully tested by the anal- 
ysis of the mock Chandra observations of the clusters from 
high-resolution cosmological simulations ( [Nagai et aT 2007b'l . 
The cosmological cluster simulations used in this worJc should 
correctly reproduce the main aspects of the ICM structure in 
real clusters, including the large-scale deviations of the main 
cluster body from spherical symmetry and intermediate-scale 
nonuniformities of the ICM density and temperature. In fact, 
the simulations reproduce the detailed X-ray properties of the 
ICM in the cluster outskirts (r > 0.2r5oo) quite well (Nagai 



et al. 2007ajl and are therefore sufficiently realistic for our pur- 



poses. In constructing the mock observations of these simu- 
lations, we carefully reproduced the essential observational ef- 
fects such as the Chandra sensitivity to plasma of different tem- 
peratures, the background level and photon statistics found in 
typical observations for both low and high-redshift clusters. 
The mock data were reduced by the same software that we use 
for the analysis of real cluster observations. 

The mock data analysis thus tests the combined effect of in- 
accuracies in all steps of our analysis, including removal of 
substructures, temperature measurements, and modeling of 
the X-ray brightness profile. The mock analysis shows that we 
recover Mgas and average temperatures very accurately. For ex- 
ample, the bias in Mgas within r = r^oo due to small-scale non- 
uniformities of the ICM is only +3%, independent of redshift. 
The unrelaxed clusters are not significantly different from the 
relaxed ones, except for a small number of outliers where the 
Mgas measurement can be biased by 10-15%. This is signifi- 
cantly smaller than t he biase s reported in the earlier work by 
|Mathiesen, Evrard & Mohr|(fT999). The improvement can be 
explained by advances in the data analysis (in particular, the 
relaxation of the assumption that the ICM density follows the 
jS -model) and inclusion in our mock analysis of the effect of 
substructure removal which was always used by observers. 

To summarize, we can state that the results from the analysis 
of mock observations vaUdate our analysis methods. The ex- 
pected residual biases have almost no effect on the derivation 
of the cluster mass function. 

4. TOTAL MASS ESTIMATES 
The cluster mass is not a well-de fined quantity and can be 



to define mass within the radius corresponding to a fixed mean 
overdensity. A, with respect to the critical density at the cluster 
redshift, pc = 3H^{z)/8nG: 



M^ = M{< r^) 



Ma 



4/3 nrl 



A X p,. 



(3) 



The choice of the overdensity threshold is driven by practi- 
cal considerations. The ultimate goal of these measurements 
is to compare the observed mass function with the theoretical 
predictions. The mass functio n models, which are calibrated 
by numerical simulations (e.g., Jenkins et al.|2001j l, are more 
robust for low values of A, where the role of numerical reso- 
lution in the simulations and non-gravitational effects within 
clusters is minimal. On the contrary, the masses derived from 
X-ray data are more robust for high values of A, where the sta- 
tistical quaUty is higher, hydrostatic equilibrium assumption 
is more accurate, etc. We need, therefore, to choose a com- 
promise between conflicting theoretical and observational re- 
quirements. We choose A = 500 — the radius within which 



the clusters are relatively relaxed (Evrard et al. 1996 > and good 
measurements of gas mass and tempera ture can b e obtained 
with our Chandra observations (Vikhlini n et al.| 2006 Naga^ 
et al. 2007b I. This is, effectively, the largest radius at which the 
ICM temperature can be reliably measured wit h Chandra and 
XMM-Newton (e.g. Vikhlinin et al. 2005 Pratt et al.|2007| . Us- 
ing significantly lower A dramatically increases observational 
uncertainties; at significantly higher values of A, the theoreti- 
cal uncertainties start to increase while there is no crucial gain 
on the observational side. 

The total cluster masses, M500, are estimated from observed 
ICM param eters. We employ the three X-ray proxies for Mtot 
discussed in jKravtsov et al.| ( |2006| KVN hereafter) — the core- 
excised average temperature, Tx; the hot gas mass, Mgas; and 
the estimated total thermal energy, Yx = Tx ^ Mgas. We rely 
on the existence of low-scatter scaling relations between these 
parameters and Mtot, as predicted by self-similar theory and 
confirmed by high-resolution cosmological simulations. 

The mass vs. proxy relations are calibrated using the hy- 
drostatic Mtot estimates in a sample of well-observed, low- 
redshift, relaxed clusters, 10 clusters from V06 plus seven ad- 
ditional objects (A2717, A31 12, A1835, A1650, A2107, A4059, 
RXJ 1504-0248) whose deep Chandra observations appeared 
in the archive since 2006 . In principle, the hydrostatic 
method can underestimate the total mass due to non-thermal 
pressure components. For example, the a nalysis of mock ob- 
servations presented in|Nagai et al.|( |200 7b I suggests that M500 
can be underestimated by ~ 13%, and this effect can be at- 
tributed to the bulk motions of the gas at large radii. We do not 
correct the normaUzation of the mass vs. proxy relation for any 
such effects because there are theoretical uncertainties in their 
magnitudes (e.g., the ICM viscosity can affect the average ve- 
locity of small-scale bulk motions). We simply account for the 
possible Mtot biases in the total systematic error budget (see §[8] 
below). Ultimately, a reUable calibration of the mass vs. proxy 
relation can be obtain ed through a stacked weak lensing analy- 
sis (e.g., |Sheldon et al. 2001 1 of a representative sample of clus- 
ters with high-quality X- ray da ta. Such data are only starting 
to become availab le now ( [Hoek stra 2007 Mahdavi et al. 200^ 
Zhang et al.|2008 1 and we in fact use them to place limits on 



ity ; 

defined in a variety of ways (see, e.g.. White 2001) . We choose 



Thes e data were reduced completely identically to Vikhlinin et al. 1 20051 
and |V06| AH primary conclusions of these papers hold tor these additional 
objects. The only effect is to improve the accuracy of the Mtot vs. proxy rela- 
tions. 
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sy stema tic errors in our calibration of the Mjot measurements 
(§|431t. 

We do apply, however, small first-order corrections to the 
observed mass vs. proxy relations when they are required 
to transfer the calibration from relaxed clusters to the entire 
population or to account for expected departures from self- 
similarity in the evolution of these relations. In doing this, we 
try to use only the most robust predictions from the simula- 
tions and to rely on the directly observed properties as much 
as possible. The corrections to each proxy are detailed below. 
The largest corrections are applied for the Mjot - Tx relation, 
while the M,ot - Yx relation does not require any corrections 
(and hence is potentially the most reliable). 



1.10 



4.1. M„ 



Ty Relation 



The average X-ray temperature is one of the most widely 
used cluster mass indicators. The M - T relation expected in 
self-similar theory is given by 

Msoo oc T'/^ E{zy\ where E{z) = H{z)/Ho. (4) 

This relation arises in a self-similar model simply because the 
ICM temperature is expected to scale with the depth of gravita- 
tional potential T oc M/R and mass and radius in our adopted 
definition are related {R oc M^^^). The relation (ml also gener- 
ally describes the ICM temperatures found in the cosmological 
numerical simulations (Evrard et al..l996,,Mathiesen & Evrard| 



200T||Borgani et aI.|2004||KVN| l. 

Tne average cluster temperature can be defined in different 
ways but the most practical, from the observational point of 
view, is the average spectral temperature — the value derived 
from a single temperature fit to the total cluster spectrum inte- 
grated within a given radial range. We refer to this temperature 
as Tx. 

4.1.1. Definition and Determination of Tx 

Spatially-resolved X-ray spectroscopy became available with 
the launch of the ASCA satellite, and since then many studies 
has indicated that the cluster scaling relations become tighter if 
the average temperature is measured excluding the cluster cen- 
teral region which is often affected by radiative cooUng. This is 
well illustrated by the re duction in scatter i n the Lx - T rela- 
tion shown in Fig.1-2 of Markevitch ( 1998 k The temperature 
profiles show a large object-to-object scatter in the centers of 
even relaxed clusters ( V06). Clearly, the central cl uster region 
should be excluded from the measurement of Tx- |Markevitch| 
( [1998 1 has us ed Tin = 70 kpc (this inner cutoff radius was also 



used in V06 and several other works). Perhaps a bett er mo- 
tivated cho ice is to set r^n a t a fixe d fraction of rsoo (Arnaudl 



et al. 2005) . We will use, as in KVN rin = 0.15 rsoo, because ap 



proximately outside this radius the observed profiles of relaxed 
clusters are self-similar ( V06). Chandra's angular resolution is 
sufficient to resolve 0.15 rsoo even in the highest-redshift ob- 
jects. An algorithmic compHcation is that the cutoff radius is 
expressed through Mtot which is itself estimated from, e.g., Tx. 
This is not a big problem since Tx is not very sensitive to the 
exact value of rin, and hence the following iteration scheme 
converges quickly: (a) measure Tx including the central re- 
gion; (b) estimate mass from M - T relation; (c) re-measure 
Tx using rin = 0. 15 fsoo and estimate new mass; repeat step (c) 
until convergence is reached. 

We also need to address the issue of the outer radius for 
integration of the X-ray spectrum. The cluster properties 
seem to become progressively self-similar at large radii (Nagai,, 



1.00 



0.90 



0.80 




Tx,2. keV 



Fig. 6. — Ratio of the X-ray spectral temperatures measured in the radial 
ranges (0.15-0.5) rsoo ("7x,2 ") and (0.15-1) rsoo {"Tx"), for clusters in the 
local sample that have a sufficient Chandra coverage. The solid line shows the 
linear approximation given by eq.jsj, with a 3% level of scatter indicated by 
dotted lines. The ratio of the temperatures is also consistent with a constant 
value, ~ 0.95 except for a few outliers at low T. 



Kravtsov & Vikhlinm 2007a|). Therefore, ideally, the spectrum 
should be integrated as tar out as possible. The exact value 
of rout is unimportant because the total X-ray flux converges 
quickly at r ^ oo. A good practical choice is to set rout = ''soo. 
because outside approximately this radius, the X-ray bright- 
ness is low compared with the background (e.g., Fig.Bl. 

Given the arguments presented above, we determine Tx in 
the radial range 0.15 rsoo - 1 rsoo- This is a straightforward 
measurement for our high-z clusters because this region fits 
completely inside the Chandra field of view and exposures 
were designed to provide a sufficient statistical accuracy. How- 
ever, for a large fraction of the low-z clusters, integration to 
rsoo is impossible because of the limited field of view . A sim- 
ple investigation shows that we can use a smaller value of rout 
in such cases. First we note that the temperature can be inte- 
grated to 0.5 rsoo for all clusters (we refer to this value as 7x,2)- 
For clusters that have sufficient radial coverage, we measured 
temperatures both in (0.15 - 0.5) rsoo and (0.15 - 1) rsoo ra- 
dial ranges. The ratio of the two values is shown in Fig.[6] It is 
consistent with a linear relation. 



Tx/Tx,2 = 0.9075 + 0.00625 Txa, 



(5) 



where the temperatures are in units of keV. The observed scat- 
ter around the linear fit is neghgible, < 3%. The ratio is also 
consistent with a constant value, ~ 0.95, except for a few out- 
liers at low temperatures. Since a tight correlation is observed, 
we can measure 7x,2 and then estimate Tx with a sufficient ac- 
curacy using equation ([sjl for those clusters that are not cov- 
ered by Chandra at large radii. 

Finally, we note that even if the trend in TxjTx^i is real, this 
does not necessarily imply deviations from self-similarity. Be- 
cause T is not constant as a function of radius, we have a mix- 
ture of spectral components within any aperture. A single- 
temperature fit to such a spectrum gives a weighted average 

Note that the gas mass can still be measured in these clusters out to rsoo 
using the ROSAT PSPC data. 





5: 



10 



CHANDRA CLUSTER COSMOLOGY PROJECT. IL 
1 1-0| 



11 



if 



10 



Tx, keV 



Fig. 7. — Calibration of the M - T relation using X-ray hydrostatic mass 
measurements for a sample of 10 relaxed Chandra clusters with the tempera- 
ture profile measurements extending to r = rsoo . The mass measurements are 
taken from V06 with 7 additional clusters (see §|4), the temperatures match 
our definitionot Tx (see § |4.1 1| . The dashed line snows the best-fit power law 
relation (parameters given m lablejsj. 

which is different from the mass-weighted T and weight ing it- 
self de pends on the typical temperature in the spectrum (Maz- 
|zotta et al. 2004 Vikhlinin 20061. Therefore, we expect trends 
in the Tx/Tx^2 ratio even if the scaled 3-dimensional tempera- 
ture profiles for low and high- T clusters are identical. 



4.1.2. Calibration of Mto 



- Tx Relation at Low Redshifts using 
Relaxed Clusters 



For 17 low-redshift relaxed clusters, there exist very high- 
quality Chandra observations, providing temperature profiles 
extending sufficiently far to permit hydrostatic mass estimates 
at r = rsoo (see introduction to §|4]l. These observations are a 
basis of our calibration of the Mjot - Tx relation at low redshifts. 
The mass and temperature measurements for these 17 clusters 
(FiglT] note that we symmetrize the error bars for simplicity) 
are fit to the power law, 

(6) 



M= Mo£(z)"^ (T/5keV)", 



normalized at T = 5 keV because this is approximately the me- 
dian temperature for this sample and therefore the estimates 
for Ms and a should be uncorrecte d. The fit is performed us 



ing the bisector modification of the|Akritas & Bershady 



(1996 



and references therein) linear regression algorithm tnat al 
lows for intrinsic scatter and nonuniform measurement errors 
in both variables. The unc ertain ties were evaluated by boot- 
strap resampling (e.g., [Press et al.|1992| , while simultaneously 
adding random measurement errors to M and T. The results 
are shown in Fig.lTland the best-fit parameters of the power law 
fit are reported iriTable[3] The best-fit slope, 1 .53 ± 0.08 is con- 
sistent with the expectation of the self-similar theory (eq|4}. 
Fixing the power law slope at 1.5 does not significantly reduce 
the uncertainty in the normaUzation (Table |3}. The XMM- 
Newton determination of the M - T relation ( |Arnaud et al.| 
[2005| ) is close to our measurement. 

Our procedure for hydrostatic Mjot estimat es was fully 
tested u sing mock data from the simulations in Nagai et al. 
( 2007b| l. This work shows that the inaccuracies introduced by 
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Fig. 8. — The fraction of clusters which could be classified as "relaxed" based 
on their observed X -ray m orphology (presence of secondary peaks, large cen- 
troid shift etc., see § |4.1.3( , as a function of z. 

the X-ray data analysis — e.g., those related to departures of 
the cluster body from spherical symmetry — are small. The 
dominant source of error are departures from equilibrium and 
non-thermal pressure components — the effect fundamentally 
missed by the X-ray hydrostatic mass estimates. For example, 
the residual random gas motions in "relaxed" clusters in the 
Nagai et al. sample seem to result in a 10-20% underestima- 
tion of Mtot near r = rsoo. Unfortunately, direct measurements 
of the ICM turbulence (and other non-thermal pressure terms) 
presently are unavailable. We thus face a dilemma: should we 
use the theoretical modehng to estimate corrections to the X- 
ray mass estimates, or should we rely only on observations? 
Our choice is to foUow the philosophy outUned in the intro- 
duction to ^ and to use the corrections suggested by simu- 
lations as an estimate of the systematic errors. A better esti- 
mate (9%) for the systematic uncertainties in the Chandra clus- 
ter mass scale can be obtained from co mparis on of X-ray and 
weak lensing mass measurements, see § 4.3.1 below. 



4.1.3. Transfer of Mtot - Tx Calibration to Entire Population 

The simulations suggest a systematic offset in the normaliza- 
tion of the Mtot - Tx relation for relaxed and unrelaxed clusters, 
in the sense that the merging clusters tend to have lower tem- 
peratures for the same rn ass (jMathiesen & Evrard 200 Ij Ven- 
timiglia et al.|2008||KV Nl. Since our calibration of the MtoT^x 



is tor a subsample of relaxed clusters, we need a procedure to 
transfer this calibration for the entire population that contains 
both relaxed and merging clusters. This can be achieved using 
a simple, first-order correction outlined below. 

First, we note that the systematic offset in the Mtot ~ Tx re- 
lation cannot be measured directly using the X-ray data. Ul- 
timately, it can be measured with a weak lensing analysis of a 
large sample. The results presented in KVN (their Table 2) sug- 
gest that the offset is (17 ± 5)% in mass for a fixed Tx- There is 
no obvious trend of this offset with redshift, or the difference 
in the slope of the relations for relaxed and merging clusters. 
Most importantly for our application, this offset can lead to 
departures from self-similar evolution in the Mtot - Tx rela- 
tion for the entire cluster population, because the fraction of 
merging clusters is expected to increase at high redshifts (e.g. 
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TABLE 3 

Calibration of mass-observable relations 



Relation Form Mo,fg,a « 



Msoo-Tx M500 = Mo(r/5keV)''£(z)"i (3.02 ± 0.11) x lO'" /i"' Mq 1.53 ±0.08 

Msoo-Tx M500 = Mo(T/5keV)''£(z)-i (2.95 ± 0.10) x lo'" /i"' M0 1.5, fixed 

Msoo-Mgas fg = fg,o + alogMis (0.0764 ± 0.004) /i"'-^ 0.037 ± 0.006 

Msoo-Yx Mr.oo^Mo{Yx/3xW^*MekeV)''E{z)-^/^ (5.77 ± 0.20) x lo'" /j'/^ Mq 0.57 ± 0.03 

Msoo-ix M500 = Mo(Yx/3 X lo'*M0keV)"£(z)"^/^ (5.78 ± 0.30) x lo'" Z;'/^ Mq 0.6,fixed 



Note. — To apply the relations, measure the mass proxy for your h of choice, and scale the normalization factor in column (2) 
according to the /i-dependence siven in column (3). Mtot - Yx relation should be a pplied according to eq.jl4|. The fg trend is used 
in the Aftot - Afgas, ^q.js) and JtI; the z-dependence of this relation is discussed in §14.2. 21 



IGottlober et£r|2 001; C ohn & White|2005| l, as we indeed ob- 
serve in our sample (Fig.jsJT 



Second, KVN and Nagai et al. ( 2007b I classified the simu- 
lated clusters as relaxed and unrelaxed using only the morphol- 
ogy of their mock X-ray images. We, therefore, can apply the 
equivalent classification to the observed clusters in our high- 
and low-z samples. "Unrelaxed" clusters are those with sec- 
ondary maxima, filament ary X-ray structure s, or significant 
isophotal centroid shifts. Nagai et al. f2007b') show examples 
of this classification applied to simulated data; more examples 
in the real data can be found in Fig.[3]-|4] The 73(: -based mass 
estimates for clusters identified as mergers should be corrected 
upwards by a factor of 1.17. Formal uncertainties on this cor- 
rection factor are ±0.05 (Table 2 in KVN); the average uncer- 
tainties for the entire population are futher reduced because 
masses have to be corrected only for a fraction of clusters (see 
below). Applying such a correction for individual objects au- 
tomatically takes into account any redshift-dependent changes 
in the fraction of mergers, and thus r emoves this source of de- 
partures from self-similar evolution. Ventimiglia et al. 12008 1 
show that the deviations from the mean A^tot - '^x relation 
in the simulated clusters are correlated with the quantitative 
substructure measures; using such an approach instead of our 
simple classification is potentially more accurate and would be 
warranted in samples of larger size. 

We note that the underlying source of difference in the 
Aifot - Tx normaUzation between mergers and relaxed clusters 
is incomplete relaxation of the intracluster gas. A fraction of 
energy is contained in bulk motions of the gas and it is gradu- 
ally converted into heat as the cluster relaxes after a merger. 
This process (unUke, e.g., radiative cooling in the center) 
should be reliably reproduced by current simulations, and so 
our rehance on the simulations to derive this correction is jus- 
tified. Our dichotomical classification is of course very approx- 
imate and a more accurate approach should take into account 
the cluster relaxation history. However, the current X-ray data 
does not allow us to quantify the cluster dynamical state with 
the require d precision. Even r ather simple substructure mea- 
sures (e.g., J eltema et al.|2007 | require more photons than we 
have for distant objects. Moreover, the process of relaxation 
should be sensitive to effective viscosity in the ICM and it is 
unclear that the current simulations which incorporate only 
low, numerical viscosity can accurately predict the Mjot - Tx 
relation for semi-relaxed clusters. They, however, should still 
be rehable for the extreme cases. In nearly- relaxed clusters, the 
turbulent motions are very weak in the inner regions (which 
dominate the Tx measurements) even in the zero-viscosity 
simulations. In post-merger clusters, most of the turbulent en- 
ergy is in the large-scale flows which dissipate on long time 
scales even if viscosity is high (Coulomb). Furthermore, the 
magnitude of this correction is relatively small. We estimate 



that the fraction of non-relaxed clusters in the sample changes 
from 35% at z = to ~ 80% at z = 0.6 (Fig.[8|. The correspond- 
ing correction for the Mtot - Tx normalization for the entire 
populations is +6% and +13% at z = Oandz = 0.6, respectively; 
thus the redshift-dependent correction is only 7% in mass. 

4. 1.4. Summary of Mass Estimates through Mtot - Tx Relation 

This section provides a summary of how we use the X-ray 
temperature for the cluster Mtot estimates. 

First, an initial value of Tx is obtained from the Chandra 
spectrum integrated within a wide aperture (not excluding the 
center). This Tx is used to estimate M500 using the power law 
fit ([6| and thus rsoo- The temperature is then remeasured in the 
annulus (0.15 - 1) rsoo and this procedure is iterated several 
times until convergence is reached. If the radius rsoo is well 
outside the Chandra field of view, we use a smaller apert ure, 
(0.15 - 0.5) rsoo, and apply corrections detailed in § 4.1.1 



Our Mtot - Tx relation is calibrated by very high-quaUty 
Chandra observations of 17 low-redshift relaxed clusters with 
a wide range of masses. The statistical accuracy of this calibra- 
tion is Ri 3%, so the dominant source of uncertainty is system- 
atics, mostly related to the possible presence of non-thermal 
pressure components in the ICM. 

Next, we need to compensate for the expected systematic 
difference in the Mtot - Tx relation for relaxed and unrelaxed 
clusters. If the X-ray morphology shows that the cluster is 
unrelaxed, the mass estimated fro m th e Mtot - Tx relation is 



. In doing so, we as- 
and unrelaxed clusters 



multiplied by a factor of 1.17 (§ 
sume that Mtot - Tx relations for relaxes 
separately evolve precisely as expected in the self-similar the- 
ory (M for fixed T scales as £(z)"'). We cannot verify this 
assumption independently of the background cosmology we 
would like to measure. Instead, we rely on the simulations to 
estimate the magnitude of possible departures from the self- 
similar scaUng. Such departures must be treated as systematic 
errors which affe ct the cosmological c onstraints. From the re- 
sults presented in Nagai et al. 12007b), we estimate this uncer- 
tainty to be equivalent to Ri 7% difference in the normalization 
of Mtot - Tx relations at z = and z = 0.6. 

To properly compute the Ukelihood function for the esti- 
mated cluster mass functions, we need to know the intrinsic 
scatter in the T^-based mass estimates. The simulations sug- 
gest that this scatter is ~ 20%, and we adopt this value. We later 
verified that reasonable variations of the scatter (in the range 
15 - 25%) have negUgible effect on fitting the estimated mass 
function. This range brackets the scatter observed in the sim- 
ulations separately for relaxed and unrelaxed subpopulations, 
as well as for low and high-redshift clusters (Table 2 in KVN| l. 
Therefore, our analysis is insensitive to reaUstic trends of the 
scatter with redshift. 
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4.2. M(, 



Mgas Relation 



Our second method of estimating the duster total mass uses 
the X-ray derived hot gas mass as a proxy. The apphcation of 
this proxy is extremely simple in an ideal case in which all clus- 
ter baryons are in the ICM, the ICM strictly follows the dis- 
tribution of dark matter, and clusters contain e xactly the cos- 
mic mix of bary onic and non-baryonic matter ( [Voevodkin & 
Vikhlinin 2004|. The total mass in this case is given simply by 



gas> 



(7) 



where Mgas is provided by the X-ray data, and fg, to the first 
approximation, equals D.i,/D.m, the ratio which is accurately 
given by the CMB measurements. To estimate the mass corre- 
sponding to a given critical overdensity, we need to solve 



4/5nr^ Pc{z) 



(8) 



for r to find the corresponding overdensity radius, ta. Equa- 
tion l|7j with Mgas evaluated at is then used to find Mtot. 

4.2.1. Corrections for Non-Universality of Gas Fraction 

In reality, the Mgas -based estimate is more complicated be- 
cause the observed gas fraction in clusters is significantly lower 
than the cosmic average (e.g., Ettori"2003 Allen et al. 112004 



LaRoque et al. 2006 Afshordi et al. 2007) and moreover , there 



are tre nds of observ ed fg with the cluster mass (e.g., Mohrl 



et al.|1999 , ,V06:_ Zhang et al.|2006 l. This trend can be relate" 



to the baryon cooling and galaxy formation ( Kravtso v et al. 
2005 1, energy feedback from the central AGNs ( Bode_et aT 



2007 1, evaporation of supra-thermal protons ( Loeb 2007]l etc. 



— processes whose theoretical modeUng is highly uncertain 
at present. The best approach is therefore to derive the trend 
fg{M) observationally. Once this is done, it can be straight- 
forwardly taken into account in eq. Js) — we just need to use 
fg{i/3 n pc) instead of a constant. 

The main problem is that direct X-ray hydrostatic Mtot mea- 
surements near r^oo are feasible only in a small number of clus- 
ters, insufficient to estabUsh the functional form of the fg{M) 
trend . We c an, however, foUow the approach used in Moh£ 
|et al.| ( |1999| l — the total mass (and hence, rspp) can be es- 



4.1 



above), and 



timated from the average temperature (see § 
then the gas mass determined from the X-ray image within 
that radius. Such estimates of fg have substantial uncertain- 
ties because of the scatter in the Mtot - Tx relation, but this 
method can be appUed virtually to any cluster. The results for 
our low-z sample are shown by grey points in Fig.[9] The his- 
togram shows the averages of these crude estimates in several 
mass intervals. Clearly, the data suggest an approximately Un- 
ear trend of fg with log M. The fg values obtained from hydro- 
static mass measurements closely follow the same trend (sohd 
black points in Fig.[9|. These, more accurate, values are used 
to determine the normaUzation and slope of the fg{M) trend, 

fg (h/0.72)^-^ = 0.125 + 0.037 logMjg, (9) 

where M15 is the cluster total mass, M500, in units of 
10^^ h^^ M©. Extrapolation of this trend to lower masses de- 

We assume that the cluster mass is the only parameter controlling sys- 
tematic trends in f^lhis assumption is consistent with current observations 
(see caption to Figpl. If there are additional parameters, their role would be 
to introduce systematic scatter in the Mgas/Mtot ratio for fixed Mtot- The ob- 
served scatter is consistent with the value we adapt based on the simulations. 
If the scatter can be related to easily measured X-ray observables, it would be 
possible to improve the quality of the Mgas proxy still futher. 
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Fig. 9. — Trend of fg within r = rsoo with cluster mass derived from X-ray 
observations. The solid black circles show the results from direct hydrostatic 
mass measurements I V06 with 7 additional clusters, see §|4). Grey circles show 
approximate estimates using the Mtot - Tx correlation (see text). The scatter 
is consistent with being purely due to mass measurement uncertainties, either 
from h ydrostatic estimates I Nagai et al. 2007b I or from Mtot - Tjc correlation 
jKVN) . The error bars indicate only the tormal measurement uncertainties. 



scribed the observed fg for galaxy groups (Sunetal.|2008l. The 

t„ is d 



uncertainties of the coefficients are such that fg is determined 
to 4 - 5% across the useful mass range, 10 - 10^^ h'^ Mq, 
resulting in the same systematic uncertainty in the Mtot esti- 
mates because of the fg{M) trend. The systematic uncertain- 
ties are, however, domi nated b y t hose o f the hydrostatic mass 
estimates (discussed in § |4.1.2| and |4.3.1| l and so the overall cal- 
ibration of the absolute mass scale with the Mgas method is the 
same as that in the Mtot - Tx 01 M - Yx relations. 

The observed fg within for the highest-mass clusters is ~ 
25% lo wer than t he cosmic baryon fraction, Cly/ D.m = 0. 165 ± 
0.005 (Komatsuet al. 2008 1. Partly, the remaing baryons can be 
in the form of stars. The observed star-to-gas ratios for massive 
clusters are in the range of 0.05-0.1 (Gonzalez et al. 20071 but 
the stellar masses are derived from population synthesis mod- 
els and thus can uncertain by factors of order 2. The tension is 
reduced still futher if the Hubble consta nt value is lower than 
we assume. For example, for h = 0.685 ( Komatsu et al.|2008 
the lower 1 a bound for the combined constraints in), the X-ray 
derived/^ values are 8% higher than we quote in eq.lj9jl. 

4.2.2. Evolutionary Corrections 

Unfortunately, we cannot observationally establish the 
fg{M) trend for high-z clusters independent of the underly- 
ing cosmology. Therefore, we have to rely on the theoretical 
models that explain the observed trend at z = and can pre- 
dict its evolution at least for the cosmologies close to the "con- 
cordance" ACDM. Since, unfortunately, no completely satis- 
factory model currently exists, this step is a major source of 
systematic uncertainties. 

One such model can be based on the numerical simulations 
presented in Kravtsov et al. (20051. The simulated clusters 
show the trend in fg{M ) which is very close to that observed 
at z = 0, both in terms of slope and magnitude of the devia- 
tion from the global baryon fractio n, 0^/0 m (Fig.fiolfl). The 
"missing" baryon component in the Kravtsov et al.^ clusters is 
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Fig. 10. — The dependence of fg within rsoo on the cluster mass observed 
in high-resolution cosmological simulations with cooling, star formation, and 
feedback I Kravtsov et al. 2005 Nagai et al. 2007b I. The simulated clusters show 
qualitatively the same trend as that observed at z = (Fig.|9l, but there is a 
clear evolution of fg for the given M. The z-dependence is almost completely 
removed if we scale the cluster masses by the characteristic non-linear mass 
scale, M» (lower panel). 



converted into stellar material, so that (Mstars + AfgaJ/Mjot 
is within 10% of the value Clh/Clm specified in the simulation. 
Observational support for this model is provided by the recent 
work of Gonzalez et al. ( 2007 1 who show that the trend in stel- 
lar mass fraction, /stars(Af), is such that it roughly compen- 
sates for decreasing fg in low-mass clusters, making /stars + fg 
nearly constant at M500 ^ lO''' h^^M© clusters, although not 
quite bringing it t o the W MAP value of D.h/O.M (see discus- 
sion at the end of §|4.2.1|l. 

The trend in the Kravtsov et al. simulations show a clear de- 
pendence on the redshift (Fig. 



10 



a) in the sense that a given 
value of fg corresponds to a systematically decreasing M500, 
although at each z, fg{M) seems to foUow Unes with the same 
slope. Empirically, we find that the dependence on the red- 



shift is almost completely removed (Fig.[To]fo), if we scale the 
cluster masses by M*, the mass scale corresponding to a Un- 
ear fluctuations amplitude of 1.686: (t(M*) = 1.686, where 
(t(M) is the rms fluctuation of density field smoothed with a 
top hat filter containing mass M. In other words, the simula- 
tions indicate that fg{M/M^) is almost independent of red- 
shift, at least at z < 1. A simple explanation of such a scaling 
can be related to the mass distribution of the cluster progeni- 
tors at high redshifts'^. We also note that quaUtatively similar 
scaling (fg for a fixed mass increases at high z) is expected if 
the gas distribution in a cluster potential well does not evolve at 
all (e.g., inner regions of a cluster remain in equilibrium and 
do not evolve significantly) and gas fraction is constant, but 
mass M500 changes simply due to evolution of the background 
critical density, pc (z), to which it is tied by the definition. 

In the spirit of our general approach of using the theoreti- 
cal results in the cluster mass estimates as minimally as pos- 
sible, we use the observed dependence of fg on mass for low- 
redshift clusters (eq.|[9|), and take a suggestion from simula- 
tion that the same trend should hold at all redshifts, if masses 
are scaled by M» computed for the given cosmological model. 
This gives us fg{M, z), necessary to estimate the total cluster 
mass from the observed Mgas (eq.jsj). Although this adopted 
z-dependence is motivated only qualitatively, the predicted 
overall correction is small. For example, for the cosmologi- 
cal model with Om ~ 0.28, ffg « 0.78 (close to the best-fit to 
our cluster data), M» = 3.6 x lO'^M© and 8.1 x lo" M© at 
z = 0.05 and 0.5, respectively. The median masses of clusters 
in our sample are 4.8 x lO^'' M© at z = 0.05 and 2.3 x 10^* M© 
(see below). The ratio Msoo/M*, therefore, varies from ~ 130 
to ~ 280, corresponding to a predicted change in fg for the me- 
dian mass clusters of 11% (eq.|9]). A reasonable estimate for 
the systematic error is around 50% of this overall correction, 
or 5-6% in terms of mass between redshifts of and 0.5. 



4.2.3. Summary for Mtc 



Mgas Relation 



To summarize, our approach to the Mgas -based estimates of 
the total cluster mass is based on using eq.||8| to find rsoo, and 
hence M500, for each cluster. In this equation, Mgas(r) is the 
observed gas mass profile derived from the X-ray image, and 
fg is the estimated gas fraction as a function of mass and red- 
shift^^. The de pendence fg{M) is determined empirically at 
z Ri (§ |4.2.1[ eq.|[9|). It is assumed that this trend evolves 
with redshift such mat fg remai ns con stant for clusters with a 



4.2.21 

this Mtot estimate are dom- 



fixed M/Mt (this is justified in § ' 
The systematic uncertainties of 
inated by those of fg. The latter can be factorized into two 
components, the uncertainties of the empirical measurements 
at z « 0, and the uncertainties of the assumed evolution with 
redshift. The low- redshift uncertainties are essentially those of 
the X-ray total mass estimates, discussed above in connection 

Most of the stellar mass in the red and old galaxies of observed clus- 
ters, which contain the bulk of the cluster stellar mass, and in cosmologi- 
cal simulations is in place by z ~ 1 - 2, so the stellar fraction within each 
lower-redshift cluster is effectively "pre set" at high redshifts. Efficiency of 
star formation within each galaxy-sized dark matter halo depends on the halo 
mass. Therefore, the stellar fraction within clusters is probably defined by the 
mass function of its progenitors near the redshift of the peak star formation 
(z = 2-3). Indeed, the calculations of the progenitor mass functions using ex- 
tended Press-Schechter theory I Lacey & Cole 1993 1 show that they are much 
more similar for clusters with the same Al 500 / Al* than for those with the same 
M500 at different z. 

FormaUy, we can write the gas fraction to be a function of radius, as 
fg = fg(M/Mt) = /g(500 pc(z) 4/3 7rrVM*(z)), and then js] becomes an 
implicit equation, which can be solved for rsoo numerically. 
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with the Mtot - Tx relation. More important for cosmological 
constraints is the redshift-dependent uncert ainty. W ithin our 
redshift range, it can be estimated as 5-6% (§ 4.2.2 1. 

The object-to-object scatter in the Mgas -based total mass es- 
timates can be easily derived from the analysis of mock X-ray 
data for simulat ed clust ers. This was done in iK ravtsov et al.l 
([20061 and Nag ai et al.| ([2007bl, who find that the scatter in 
tne Mtot - Mgas relation is approximately 11% in Mtot for a 
given Mgas. Most of this scatter results from the X-ray analy- 
sis, as intrinsic scatter of the gas mass for a fixed total mass in 
simulated clusters is < 5%. 

4.3. Mtot - Yx Relation 

The final Mtot prox y we us e is the most robust X-ray mass 
estimator proposed by KVN The quantity, Yx, is defined as 

Yx=TxX Mgas.X, (10) 

where Tx is the temperature derived from fitting the clus- 
ter X-ray spectrum integrated within the projected radii 
0.15 rsoo - 1 ^500, and Mgas.x is the hot gas mass within the 
sphere rsoo, derived from the X-ray image. 

The quantity that Yx approximates is the total thermal 
energy of the ICM within rsoo, and also the integrated 
low-f requency Sunyaev-Zeldovich flux (S unyaev & Zeldovich^ 



1972[ ). The total thermal energy, 7, was found in the simula- 
tions to be a ver y good indicator of the total cluster mass (fdaj 



Silva et al.l2004||Motl et aH2005| [Hallman et all2 006 Nagai^ 
20061. In tlie simplest self- similar model l |Kais'er[r986||1991f T' 



Y scales with the cluster mass as 



Mt< 



^3/5 



E{z) 



-2/5 



(11) 



(e.g., KVN| l. This scaling is a consequence of the expected 
evolution in the Mtot - T relation (eq.Q) and the assump- 
tion of the self-similar model that is independent of clus- 
ter mass. Hydrodynamic simulations show that the expected 
scaUng [TT| is indeed vahd, and moreover, the relation shows a 
smaller scatter in M for fixed Y than, e.g., the M -Tx relation. 
The primary reason is that the total thermal energy of the ICM 
is not strongly disturbed by cluster mergers (Poole et al.|2007|l, 
unhke Tx or X-ray luminosity (Ricker & Sarazin 2001 1. 

It is reassuring that the Mtot ~ Y scaling also appears to be 
not very sensitive to the effects of gas cooling, star formation, 
and energy feedback ( Nagai 2006^ — these effects do not affect 
the power slope or the evoliation law, although change some- 
what the overall normalization. The stabiUty of Y is primarily 
explained by the fact that gas cooling tends to remove from 
the ICM the lowest-entropy gas ( Voit & Bryan 2001 ), increas- 
ing the average temperature of the remaining gas and thus af- 
fecting Tx and Mgas in opposite w ays. Direct hydrodynamic 
simulations of Nagai et al.'('2007a) confirm this expectation. 

As discussed in KVN, the X-ray proxy, Yx, is potentially 
even more stable with respect to cluster mergers than the "true" 
Y. In the post-merger state, for example, the temperature and 
thus Y is biased somewhat low because of incomplete dissipa- 
tion of bulk ICM motions. The same bulk motions, however, 
cause the gas density fluctuations, which lead s to an overesti - 
mation of Mgas from the X-ray analysis ( Mathiesen et al.T999|. 
Therefore, the merger- induced deviations of the average tem- 
perature and derived Mgas are anti-correlated and hence par- 
tially canceled out in Yx- Even t he stro ngest mergers in the 
simulated cluster sample used in KVN| do not lead to large 
deviations of Yx from the mean scaling. There is also no de- 
tectable systematic offset in the normalization of the Mtot - Yx 
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Fig. 11. — Calibration of the Mtot - Yx relation. Points with errorbars show 
Chandra results from Vikhlinin et al. 120061 with 7 additional clusters (§^. 
Dashed line shows a power law fit (excluding the lowest-mass cluster) with 
the free slope. Dotted line shows the fit with the slope fixed at the self-similar 
value, 3/5 (parameters for both cases are given in TablejsJ. Open points show 
weak lensing measurements from Hoekstra 1 2007 1 (these data are not used in 
the fit); the strongest outlier is A168y (open star), a known case of large scale 
structures superposed along the line of sight. 

relations for relaxed and unrelaxed clusters. The upper limit 
for the difference in Mtot for fixed Yx within the |KVN| simu- 
lated sample is 4% (see their Table 2). 

Since Yx is so insensitive to the cluster dynamical state, it 
is straightforward to calibrate the Mtot - Yx relation using the 



sample of Chandra clusters from V06 and then it is reason- 
able to assume that the same relation is also vahd for unre- 
laxed clusters. The observed Mtot - Yx relation does foUow 
very clo sely the expected se lf-similar scaling of eq.pT|(Fig.[TT] 
see also Arnaud et al.|2007l l. The best-fit power law is 

MEizY" oc 7^-53±o.o4 (^2) 

when all clusters are included. The marginal deviation of the 
slope from a self-similar value of 3/5 is driven primarily by 
the lowest-temperature cluster (MKW4), for which both the 
total mass and Yx measurements are most uncertain. Exclud- 
ing this cluster (its Mtot is in any case smaller than the lower 
mass threshold in the cluster mass functions in our samples), 
the power law fit becomes 



ME{zy'^ oc y. 



(13) 



fully consistent with the self-similar relation (shown by a 
dashed line in Fig.[TT|. We use the latter fit for the Yx-based 
cluster mass estimates. Note that jSun et a l. (20081 find a slope 
of 0.5 7 when they fit jointly their galaxy group sample with the 
|V06| clusters, supporting the notion that the MKW4 measure- 
ment can be ignored. The normalization constant is provided 
in T able[3](it is consistent with the XMM-Newton results of |Ar-| 
|naud et al.|2 007 l. Note that the ft -dependence of the normal- 
ization constant in the Mtot - Yx relation is oc h^/^, different 
from the usual h^^ in, e.g., the Mtot ~ Tx relation. This is the 
consequence of the ft -dependence of the X-ray Mtot and Mgas 
estimates, see |KVN| for details. 

The overall uncertainties of the calibration of the Mtot - Yx 
are identical to those for the Mtot - Tx relation (see §|4.1|l, with 



16 



VIKHLININ ET AL. 



the exception that we do not expect an additional source of 
uncertainty related to the transfer of calibration from relaxed 
clusters to the entire population. As for the M^ot - Tx rela- 
tion, we also have to rely on the simulations for an estimate of 
redshift-dependent de parture s from the expected self-similar 
scaUng. The results of |KVN| provide an upper Umit of < 5% 
for the evolution of the amplitude of the relation at a fixed Yx 
between z = and 0.6. The expect ed leve l of scatter in the 
Mxoi - Yx relation (7% in Mtot, see |KVN| l is below the un- 
certainties of Chandra hydrostatic mass estimates for individ- 
ual clusters. Indeed, the intrinsic scatter is undetectable in the 
data. Since the scatter is expected to be small, its exact value is 
unimportant for modeling the mass function, and thus can be 
safely adopted from the simulations. 

4.3. 1 . Systematic Error of Chandra Mass Measurements 

Using the Mtot - ^x relation, we can address the question 
of absolute calibration of the Chandra mass estimates through 
comparison with recent weak lensing mass measurements in 
representative samples of clusters. Weak lensing measure- 
ments of M500 in individual objects still have ~ 30% uncertain- 
ties, and are expected to have a similar intrinsic unce rtainty 
due to projection of structures along the line of sight ( Metzler] 
etal.|20 01 ). However, as the current weak lensing samples start 
to include more than 10 objects, the average normalization of 
Mtot vs. proxy relations can be measured to better than 10% . 
The two useful recent studies are those of |Hoekstra| l |2007"l > 



and Zhang at al. 120081. In Fig. 11 we compare th e Chandra 



Mu 



- Yx relation with that for i ow-z clusters in the |Hoekstra| 



20071 sample. The Yx for all Hoekstra clusters in this plot were 
erived from Chandra data using the procedure applied to our 
cosmological samples. With the exception of a single outlier 
(A1689, a known case of of large scale struc tures superposed 
along the line of sight, e.g., Lokas et al. 2006| l, the weak lensing 
masses for given Yx are in good agreement with the Chandra 
values. Fitting the ratio of normaUzations of the Mtot - re- 
lations obtained from bisector fits to the two dataset with the 
slope fixed at 0.57, we find m^^'Va^^^'"'"*"^ = 1-01 ± 0.11. 
A similar ag reement is found for the weak lensing masses in 
Zhang et al.| ( |200 8). The normalizations relevant for our case 
are presented in their Table 3. After correcting their Yx by -t-7% 
to compensate for a systematic difference currently observed 
between Chandra vs. XMM-Newton temperatures , we find 
m(«') /^(c/ia«dra) = 1 05 ± 0.07. The weighted average for the 
two samples is M^^^V-M^^'"""*"^ = 1.04 ±0.06. The integrated 
probability within the M^^'V'M^'^'"'"'*™^ = [0.91 - 1.09] inter- 
val is 0.7, thus ±9% is a good estimate for la systematic uncer- 
tainties in the Chandra cluster mass scale calibration. 

4.3.2. Application of the Mtot - Yx Relation for Real Data 

In application of the Yx -based mass estimates to the real 
data, we face a practical problem that Yx should be determined 
within rsoo, which is itself unknown. Moreover, Yx{r) di- 
verges at r ^- 00, although less quickly than Mgas(r). The total 
mass should thus be estimated with the approach similar to the 
Mgas-basedmethod(eq.|j8]) — we find rsoo and hence M500 by 

Here, we are interested only in bringing all the measurements to the 
Chandra temperature scale because we use Chandra data. It may well be that 
the XMM temperatures are in fact correct. The temperature calibration un- 
certainties should be treated as an additional source of systematic errors (see 
§|8.1.1| for more details). Fortunately, the estimated uncertainties are within 
the bounds suggested by comparison of the Chandra and weak lensing mass 
measurements. 



solving the following implicit equation 

C{TxM^Ur)rE{z)-''' = 500 x 4/3 rrr^ (14) 

where C and a are the parameters of the power law approxi- 
mation to the Mtot - Yx relation, Mtot = CY^ E{zy^/. 

5. SURVEY VOLUMES 

We now need to turn to the next critical component of the 
cluster mass function derivation — determination of the ef- 
fective survey volume. Our cluster samples are derived from 
essentially purely X-ray flux limited surveys. We can then 
straightforwardly compute the sample volumes as a function 
of X-ray luminosity, 

dV 



dV 

ViL)= / A{f,,z)—dz, (15) 
Jzi dz 



where / is the X-ray flux corresponding to the object with lu- 
minosity Lx at redshift z,dV/dz is the cosmological volume- 
redshift relation, and A(/^,z) is the effective survey area for 
such objects. A relation between cluster luminosity and flux, 

/= ,, K{z), (16) 

4ndi{z)-^ 

depends on the cosmological background through the bolo- 
metric distance d ijz) and the JC-correction factor (see, e.g., 
Jones et al.|1998| specifically for the case of the cluster X-ray 



spectra). Ihe X -correction depends on the assumed cluster 
temperature but this dependence is very weak if both fluxes 
and luminosities are measured in the soft energy band (0.5- 
2 keV as we use here) . In practice, a sufficient level of accuracy 
is achieved by estimating T from the non-ev olving Lx - T rela- 



1998 



tion accurately m easured for low-z clusters (Markevitch 
Tukazawaetal.T998l. 

Because the objects in our low-redshift sample are all well 
above the RASS detection threshold, their survey area, A{f, z), 
is adequately approximated by a constant value, 8.14 sr, eq ual 
to the geometric area of the s ky regions covered (see § |2.2| and 



Reiprich & B6hringer||200 2'l. The situation is more complex 
tor our high-z clusters drawn from the 400d survey. Sky cov- 
erage there is a function of flux because our distant clusters are 
generally not much brighter than the detection thresholds in 
individual ROSAT pointings and because the detection thresh- 
olds also vary widely depending on the exposure time of each 
pointing. Formally, the sky coverage is also a function of red- 
shift because detection efficiency is somewhat sensitive to the 
cluster angular size. A detailed discussion of these effects in 
application to the 400d survey, as well as a careful calibration 
of A{f,z) for the fuU 400d sample was presented in ^Paper I| 
(see their § 7). 

An additional comphcation arises because we use only a 
brighter subsample of the 400d sample at 0.35 < z < 0.473 



(see § 2. 1 and Fig.[TJ . We need, therefore, to recompute A(/, z) 



using eq.[2-3] from Paper I with in their eq.[2] set to 
the actual selection fluxes used in our subsample. This is a 
straightforward calculation but the results cannot be conve- 
niently presented in a paper. We provide machine-readable 
tables for A(/, z) at the 400d survey WWW site^**. 

Stability o f the 400 d survey area calculations was extensively 
discussed in |Paper I| The general conclusion is that the uncer- 
tainties in A{f,z) do not exceed 3%, and therefore they make 
a negligible contribution to our overall error budget. A dom- 
inant source of uncertainty in determining the volume as a 
function of mass is the details of the Lx - M relation. 

' ^ http://hea-www.harvard.edu/400d/CCCP 
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5.1. Lx - Mtot relations 

To fit mass function models to the data, we need to know the 
survey volume as a function of mass, not luminosity. The two 
are trivially related if there is a well-defined relation between 
the cluster mass and its luminosity: 



dV{M) r dV{L) 



I' 



P{L\M,z)dL, 



(17) 



dz Jl dz 
where P{Lx\M, z) is the probability for a cluster with mass M 
to have a luminosity Lx at redshift z. The volume in the given 
redshift interval is obtained by integrating this equation, 

V{M)= f'dz f A{f^,z)^P{L\M,z)dL, (18) 
Jzi Jl dz 
where dV/dz is the cosmological volume-redshift relation and 
A{fx,z) is the survey area coverage (cf eq 15 i. 

The simplest model that seems to adequately describe the 
observed Mtot - Lx relations can be represented as a power 
law with approximately log-normal intrinsic scatter around 
the mean which is independent of mass and redshift, and the 
redshift evolution that changes the normalization but keeps 
constant the slope of the power law, 

P(lnL|M)ocexp p^"^;]f°)' ), (19) 



where 



A(z) M". 



(20) 



The evolution factor is sometimes approximated as a power law 
of ( 1 + z) (the simplest model) and sometimes as a power law 
of £(z) (self- sim ilar evolution inspired models, see e.g., Bryan 
|&Norman|1998| : ' ' 

A{z) =Ao(l + zy or A{z) = AoE{zy (21) 

A recent study by "Maughan (2007 consistent with our results 
below) indicated that the evolution factor, E{zy, is in fact 
close to that expected in the self-similar model for the "con- 
cordant" cosmolo gical model. Howev er, the general consen- 
sus has been (e.g.,|Borgani et al. 2001 1 that we should not rely 



on the simplest theory for the evolution in the Lx - Mtot rela 
tion and instead should determine it empirically for each back- 
ground cosmology. We take this approach in the present study. 



5.1.1. Fitting Procedure and Treatment ofMalmquist Bias 

In the model specified above, the Lx - M relation is char- 
acterized by four parameters, Ag, a, y, and a. These parame- 
ters can be determined using mass estimates for clusters from 
our sample. The large size of our sample even allows us to test 
the basic assumptions of the model, e.g., that the scatter is log- 
normal. A major compUcation, however, is Malmquist bias. In 
a flux limited sample, the average luminosity of selected clus- 
ters is higher than that in the parent population. Th e effect 
becomes strong if the scatter in Lx for fixed M is large ( Stanek 
let al.|2006||Nord et al.|200 7l, as observed. Nord et al. address 
the question in which regimes the bias is unimportant. Stanek 
et al. describe how to estimate the magnitude of the Malmquist 
bias by simulating mock catalogs. Stanek et al. also discuss how 
to derive the mean Lx-M relation and scatter from the cluster 
X-ray luminosity function if the cosmological model (includ- 
ing ffg) is assume d known. A similar approach was used by 
Ikebe et al. ( 2002| l who consistently modeled the L-T relation 
together wilJi the cosmological fit to the cluster temperature 



function. However, as we show in Appendix|A] estimation of 
the Malmquist bias can be separated from the cosmological fit 
to the mass function, which leads to simpler algorithms than 
those used by Stanek et al. and Ikebe et al. An approach similar 
to ours was independently developed by Pacaud et al. (20071. 

Our algorithm is fully described in Appendix[A| Here, we 
briefly outline the main results and modeling steps and then 
proceed to presenting the results. A typical situation in low- 
z surveys with a high flux Umit is that the search volume is a 
power law function of the object luminosity (e.g., V oc L^l^ 
in Euclidean space and no low-z cutoff), and that the evo- 
lution can be neglected wit hin th e survey's effective redshift 
depth. In this case (Appendix |A.l| , the Malmquist bias leads to 
a constant offset in the normalization of the observed Lx-M 
relation; the observed rms scatter in InL for fixed M equals 
the standard deviation of the log-normal distribution {a in 
eq. [T9] ) . The true relation is therefore very simple to recover 
for typical low-z surveys. For the observed scatt er, a = 0.39 



(see below), the bias is A InL 3/2 = 0.23 (eq.||A4]), or Lx 
for fixed M is overestimated b y ^ 26% (much sma ller than the 
factor of ~ 2 bias advo cated by|Stanek et al.|2006| but cosistent 
with the Umit from Reiprich'(2006). This bias is independent 
of the actual flux limit of the low-z survey. If uncorrected for, it 
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mass by « 40% 



leads to overestimation of the volume fqi 

3/2 

(because V{M) oc L^ at low z, see eq. 

The treatment of the Malmquist bias is more comphcated 
if the evolution in the Lx-M relation cannot be neglected. 
However, in this case it is still possible to derive a likelihood 
function which can be computed numerically given the sur- 
vey selection functions and which impUcitly depends on the 
par ameters of the evolving Lx-M relation, Aq, a, y, and a 
(eq.| AlO - All| ). One can also compute th e ave rage bias for 
each cluster given Aq, a, y, and a (Appendix |A.2| ; using these 
corrections we can easily check if the basic model assumptions 
(e.g., that the scatter is log-normal and independent of both M 
and z) are sufficiently accurate. The tests of our fitting proce- 
dure using the mock catalogs show that it recovers the true 
parameters of the Lx-M relation without significant biases. 

5.1.2. Results for Lx - My relation 

We independently derive the Lx-M relation for each of 
our mass proxies. In this section, we summarize the results 
obtained with the Yx proxy (hence the relation is called Lx - 
My); the results for the Tx and Mgas proxies are very similar 
The best fit (obtained with our maximum likelihood method) 
to the evolving relation in the form eq. (20|]2T) is 



Inix = (47.392 ±0.085) + (1.61 ± 0.14) InMjoo 
+ (1.850 ± 0.42) ln£(z) - 0.39 ln(ft/0.72) 
± (0.396 ±0.039) 



(22) 



where the last term on the right hand side indicates the ob- 
served scatter in Lx for fixed M. The uncertainties for each 
parameter are obtain ed fro m the mock catalog simulations de- 
scribed in Appendix A.3 For the median mass in our sam- 
ple, the best-fit normalization agrees very well with that from 
(Reiprich & Bohringer 2002, their Table 10, after converting 
the luminosities to the 0.5-2 keV band), even though we do 
expect some difference due to correctio ns fo r the Malmquist 
bias applied in our analysis (Appendix |A.1| . In this regard 
we note that our more elaborate procedure for hydrostatic 
mass estimates should lead to sistematicallty different results 
than a simple isothermal -model analysis used in Reiprich &| 
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Fig. 12. — Results for the mass-luminosity relation with Mtot estimated from 7j 


. Left panel shows the correlation for fow-redshift clusters (black points) with the 


5£st-tit power law relation. Ihe red points show the data tor high-z dusters with the luminosities corrected tor the evolution [£(z)' J. All luminosities are corrected 


^r the expected Malmqiust bias (see Appendix|A.2|. Right: Evolution m the normalization ot the - M relation. Individual measurements have been corrected 


for Malmquist bias and divided by the best-ht low-z relation. Solid and dotted lines show the best ht in the lorm £(z)' and ( 1 + z)' , respectively. In both panels^ 


tEe clusters with large correction (A InL > 0.5) are shown with open symbols. Ihe lack ot a systematic ottset between clusters with the estimated strong and wea^ 


Malmquist bias proves that the correction has been applied correctly. Ihe z > 1 clusters in this panel are trom the RUCS survey 1 lozzi et al. 2UU3 1; they were noT 


used m the ht and are shown only to demonstrate that the extrapolation ol our best-ht J; (z) 


> evolution to higher redshrtts still produces reasonable results. 






Fig. 13. — Distribution of the deviations from the mean Lx - M relation 
for the low-z sample (where the contribution of measurement uncertainties 
is negligible). Solid line shows the best-fit log-normal distribution with the 
scatter (T]„ = 0.396. 



Bohringer ( 2002|l ; the net effect of updated Mtot measurements 
and corrections lor the Malmquist bias appears to lead to very 
small revisions of the normalization of the Lx - Mtot relation. 

The left panel in Fig. 12 shows that indeed, the low-z data are 
adequately described by a single power law relation. The high- 
z clusters also follow the same relation with approximately the 
same scatter, after correction for the evolution in the overall 
normalization [i;(z)^'^^]. The observed evolution in the nor- 
malization (right panel of Fig. 12 i is consistent with the E{zy 



evolution law is not crucial for our purposes since we use the 
Lx - M relation only to estimate the survey coverage at each 
redshift and not to estimate the cluster masses. The effect of 
the choice of the parametrization on the derived V{M) is dis- 
cussed below. 

The observed deviations from the mean relation at low red- 
shifts (Fig.[l3} are consistent with the log-normal distribution 
with a scatter of crit,^^ = 0.396 (or « ±48%) in Lx for fixed 
M. The contribution of the measurement uncertainties to this 
scatter is negligible for low-z objects. The expected scatter in 
the Mtot estimates using Yx is also significantly lower. There- 
fore, it is reasonable to expect that the observed scatter is a 
good representation of that in the relation between Lx and true 
mass''. The current data quaUty is insufficient to characterize 
the shape of the scatter distribution precisely. For example, we 
cannot check if the tails of the distribution are consistent with 
the log-normal model. The knowledge of tails in the P{Lx\M) 
distribution is cru cial if one uses Lx as a proxy for cluster mass 
| |Lima & Hu|2005, |. In our case, however, the Lx -Af relation is 
used only for the survey volume calculations, where the effects 

below). 

M relation imphes that 



5.1.3 



scahng, but also with a (1 + zy law. The exact form of the 



of the P{Lx\M) are minor (see § 

The observed 48% scatter in theTx 
Malmquist bias effects are very significant. For example, in a 
purely flux-limited low-z sample, the average bias in the lu- 
minosity for fixed M is A InL = 0.235 or 26% (see eq.| A4] in 
Appendix|A.ip. This is quahtatively similar to the conclusions 
of Stanek et al. ( 2006 ), although our predicted bias is lower be- 
cause Stanek et al. have assumed a larger scatter in the Lx - M 
relation than that observed in our data. 

5.1.3. Results for V{M) 

" Note that we are forced to use th e tot al luminosities, including centers 
and substructures, for reasons given in § |3.2| If these components are excluded 
from the flux m easurements, the scatter can easily be made lower, see, e.g., 
|Maughan|j2007) 
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Fig. 14. — Volumes (comoving) as a Function of cluster mass for our low red- 
shift sample (z = 0.025 - 0.25) and three high-redshift subsamples of the 400d 
survey. 

With the model for the Lx - M relation at hand, we can 
now compute the search volumes as a function of cluster mass 
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15 1. The results for our local sample and the three red- 

The volume 
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shifTBins in the 400d sample are shown in Fig. 
for the local sample follows a power law function of M in a 
broad range of masses, as expected for a flux-Umited sample. 
A sharp decUne of the volume atM<1.5xlO^'' M© is due to 
a combination of the flux threshold and a lower redshift cut- 
off of the sample (z > 0.025). The sample becomes volume- 
Umited for high masses because we imposed an upper cutoff 
(z < 0.25) in the volume calculation. For the three high-z sub- 
samples shown in Fig. 14 the dynamic range in z is smaller and 
the transition from the volume-limited to strongly incomplete 
regimes is much sharper 

We should now discuss how sensitive the survey volume 
computation is to the assumptions in the Lx - M relation 
model. The largest uncertainty in the volume computation is 
related to the measurement errors of the luminosity scale for 
fixed M. The effect is strongest for the high-z data because the 
normalization of the Lx-M relation is derived using a smaller 
number of clusters, with larger measurement uncertainties, 
and spanning a range of redshifts (see Fig.[T2]l. Overall, the 
uncertainty in the high-z relation c orresp onds to ±10.5% in 
the Lx scale at z = 0.55 (Appendix ] A. 3|l . This is equivalent 
to varying y by ±0.33 assuming that the low-z normalization 
is fixed; note that the range Ay = ±0.33 is smaller than that 
quoted in eq.(|22} because the latter also includes uncertain- 
ties in the low-redshift normalization. The long-dashed line in 
Fig. 15 shows how the volume calculation for our high redshift 
sample, 0.35 < z < 0.9, is affected by changing y by +0.33. Re- 
assuringly, the relative change of sample volume is large only 
for low-mass clusters where it becomes comparable to the Po is- 
son uncertainty of the derived mass function (see in § 8.2. 1 be- 
low). 

By comparison, the sensitivity of V{M) to a particular 
choice of the Lx-M evolution model is relatively minor. 
For example, if we use the (1 + z)^ scaling instead of E{zy 
(eq.| 21 1), the largest difference in the best-fit Lx-M relations 
is near z = 0.5 (right panel in Fig.[T2}. The correspondin g rel a- 
tive change of y(M) atz = 0.45-0.55 (dashedhneinFig. 15 i is 
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Fig. 15. — Sensitivity of the volume calculations to various variations of the 
Lx-M relation model. 
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much smaller than the Poisson uncertainties of the mass func- 
tion in the same redshift bin (see below). Therefore, the uncer- 
tainties related to the parametrization of the Lx-M evolution 
can be neglected for our purposes. 

The effects of the scatter uncertainties on the V{M) com- 
putations are comparably small. Note that the situation is cru- 
cially different if one uses Lx to estimate the cluster masses. 
Consider for example, a case of volume-limited survey. The 
V{M) ftinction is unchanged in this case by variations of cr, 
while the estimate o f the clust er mass function is still very 
strongly affected (see Lima &Hu (2005 1). Variations of a affect 
the V{M) computations m two ways. First, there is a positive 
correlation of V and a because of the scatter term in eq.| 17 
(assuming that V{L) increases with L). However, a also im- 
plicitly enters the determination of the Lx-M normalization 
because we need to correct for Malmquist bias; the larger the 
cr, the lower the Lx for fixed M inferred from the same data 
(e.g.. Appendix [a|, and hence the smaller V{M). We need to 
include both these effects to test properly the effect of the a un- 
certainties on V{M) . This was achieved by fixing the value of 
cr at the boundaries of its measurement uncertainties (±10% of 
the best-fit value, see eq.p2)), refitting all other parameters of 
the Lx-M relation, and computing V{M) for these new fits. 
The results are shown in Fig.fTslby the solid and dotted line for 
the high- and low-z bins, respectively. The variation of volume 
is negligible for the low-z sample, but is more substantial for 
the high-z clusters. It is, however, much smaller than the effect 
of uncertainties in the value of y considered above. Note that 
increasing the scatter reduces the volume, indicating that the 
effect of extra Malmquist bias correction on the Lx-M nor- 
malization outweighs the boost in volume due to an increased 
scattering kernel in eq. (Tt) . 



6. CLUSTER MASS FUNCTIONS IN THE CONCORDANT 
ACDM COSMOLOGY 

With the survey volume in hand, we can finally compute the 
mass functions. Figure 16 shows the mass function in the cu- 



mulative representation computed as 



N{> M) 



M,>M 



(23) 
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Fig. 16. — Cluster mass functions for our low and high-z samples. The masses 
were estimated by the Yx method. The errorbars show the Poisson uncer- 
tainties. Solid lines show the model predictions for the adapted cosmological 
model CIm = 0.3, Ha = 0.7, h = 0.72, with only ag fit to the cluster data 
(see text). The evolution of the mass function is non-negligible within either 
redshift range. To take this into account, themodel number densities for each 
mass were weighted with dV{M)/dz (eq. |l7| ) within the redshift bin. 

Our samples span similar mass at low and high redshifts, 
which is very important for the robustness of the derived cos- 
mological constraints. A strong and highly significant decrease 
in the comoving cluster number density at a fixed mass is ob- 
served between z = and z ^ 0.5, by a factor of 5.0 ± 1.2 at 
M500 = 2.5 X 10'* h^^ Mq. This reflects the growth of cosmic 
structure between these redshifts. Indeed, the observed evolu- 
tion of the cluster mass ftinction is in good agreement for the 
"concordance" cosmological model with the power spectrum 
normalization 
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we use the 



0.74 6 (solid lines in Fig. 
mass function model from Tinker etal.|2008, anaour approach 
to the model fitting is discussed in §W\. Ihe strongest observed 
deviation of the data from the model is a marginal deficit of 
clusters in the distant sample near M500 = 3 x 10^* h^^ M© 
— we observe 4 clusters where 9.5 are expected, a 2cr de- 
viation. The cumulative function fully recovers by M500 = 
2 X 10'* h^^ Mq, approximately the median mass in the dis- 
tant sample. The differential representation of the mass func- 
tion (Fig.[l7} also shows that this deficit is consistent with the 
Poisson noise expected in the data. 

Our high-z sample can be split into several redshift bins to 
check if the observed evolution within the sample is still con- 
sistent with the model. Figure[T8|shows the results for the three 
bins, z = 0.35 - 0.45, 0.45 - O.^Tand 0.55 - 0.9, approximately 
14 clusters in each. The data are still in good agreement with 
the model predictions. The strongest deviation is a marginal 
(~ Iff) deficit of clusters at z = 0.35 - 0.45. 

6.1. Sample Variance 

In addition to the Poisson cluster counting uncertainties, 
there is sample variance in the number of clusters in a survey 
of Umited volume due to large-scale clustering. Depending on 
the mass scale, the sample vari ance can be comparab le to, or 
larger than, the Poiss on errors ( Hu & Kravtsov|2003) l. We fol- 
low the formalism of Hu & Kravtsov'to assess the iniportance 
of sample errors in the error budget in our case. 
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Fig. 17. — Differential representation of the mass functions shown i 
The error bars in representation are uncorrelated (unlike Fig|16[, 
significance of the observed deviations from the best-fit model can be easily 
assessed. 

We calculate the sample variance for the two geometries. 
For the local sample we assume aU-sky coverage with an ex- 
clusion zone of ±20° from the Galactic p lane; the var iance for 
this geometry is given by equation A7 of Hu & Kravtsov ( 2003). 
The second is a pencil -beam volume with a small circular foot- 
print on the sky, which is appropriate for the individual ROSAT 
fields included in the 400d survey; the variance for this geom- 
et ry can be computed using the flat-sky approximation (eq. 7 
in |Hu & Kravtso v'2003). The variance calculations are done 
for our reference cosmology with the power spectrum normal- 
ization ffg = 0.8, resulting in a slightly higher variance than 
what would be predicted for our best-fit cosmological model 
with slightly lower ff g. The halo mass function model is from 
'Jenkins et al.' (2001' their eq. B3) and the cluster bias model 
is from Sheth & Tormen ( 19991. These mass function mod- 
els use cluster masses defined within the aperture enclosing an 
overdensity of 180 with respect to the mean density, and so we 
need to relate it to the mass definition adopted here (A = 500 
with respect to critical density). We assumed a simple rela- 
tion, M500 ~ O.55M180, appropriate for typical concentrations 
of clusters in our mass range. 

The relative imp ortance of the sample variance increases for 
low-mass clusters l|Hu & Kravtsov 2003 1. Fortunately, it is still 



sufficiently small near our mass limit. For example, at the Um- 
iting M500 = 1.5 X lo''' M©, the local survey volume cor- 
responds to the effective redshift depth Zmax = 0.043; the sam- 
ple variance calculated for this mass Umit and redshift range 
z = 0.025 - 0.043 is a^^, = {{n^)l{nY - ifl^ « 0.16. This 
should be compared with the Poisson errors at this mass limit, 
ffshot = 0-24 (Fig. 161. Combining these variances in quadra- 



ture, we find that'tEe total uncertainty fftot = (ffyar + ''Thot)'^^ 
is only 17% larger than the Poisson value. The contribution of 
sample variance quickly becomes small for higher masses. For 
example, at M500 = 3 x lo'** h^^ M© (the median mass for the 
low-z sample), the total uncertainty is only 7% larger than the 
Poisson error; the contribution becomes negligible at higher 
masses. Going to lower masses, we predict that ay„ becomes 



comparable to ffshot for M500 
mass limit. 



7 X 10^-' h'^ M©, below our 
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Fig. 18 
bins. 



Same as Fig.|16|but the high-z sample is split into three redshift 



The high-z sample consists of 1600 widely separated (and 
therefore independent) pencil-beam pointings. For a single 
pointing of circular radius of 17.5' and redshift range z = 
0.35 - 0.45 (the variances for the higher redshift bins are sim- 
ilar but somewhat smaller), the sample variance is ffvar.i ~ 
0.65 - 1.65 for the samples with M500 thresholds between 10^* 
and 10^^ M©. Assuming that the individual pointings are un- 
corrected (a good assumption for the widely separated point- 
ings of the 400d survey), the total sample variance is ffvar ~ 
o-var.i « 0.02 - 0.05, where N = 1600 is the number of 

400d survey pointings, much smaller than the Poisson uncer- 
tainties. The sample variance can therefore be safely neglected 
for our high-z sample. 

In principle, sample variance can be included in the calcula- 
tion of the likelihood functions for the low-z sample ( Holder 
|2006; Hu & Cohn 2006). The procedure, however, would be 
quite cumbersome in our case and is not worth the effort be- 
cause the variance increases the measurement errors by only 
17% in the worst case, and by 7% or less for the median sam- 
ple mass. This is considerably smaller than expected system- 
atic effects and we will therefore neglect the sample variance 
hereafter. 

7. LIKELIHOOD FUNCTION 

Let us now consider the expression for the likelihood func- 
tion appropriate for our sample and for our method of deriving 
the mass functions. The basics of the likelihood ftinction are 
very standard and used in large nu mber of o ther works. We 
follow the derivation presented in Cash 1 1979( 1 for the case of 
purely Poisson statistics. We split the mass mtervals into nar- 
row bins, AM, so that the probability to observe a cluster with 
an estimated mass in this bin is small, |i(M'^*',z) AM « l,and 
we have at most one cluster per bin. The Uk elihood function 
in this case can be written as (c.f |Cash|1979| l 



lnL = Xhi(p(Mf,z,)AM,) 



JJm,z 



(24) 



where summation is over the clusters in the sample and inte- 
gration is over pre-selected Zmin - Zmax and M^nm - Afmax inter- 
vals. Usually, the AM terms can be dropped because they are 
independent of the model parameters and thus simply add a 
constant to the likeUhood function. In our case, however, the 
estimated masses are also a function of the background cos- 
mology. When M"' is changed because of the variation in the 
cosmological parameters, we should correspondingly stretch 
the mass interval, AM = AM'^°^ M / M'^°\ where M^") and 
are the estimated mass and width of the interval for 
some fixed reference cosmological model.^" Taking the loga- 
rithm of this expression and dropping constant terms (M^"-' 
and AM^"-*), we obtain the likelihood function in the form 



InL = ^lnp(Mf ,z,) + Y,^nMf 



ff p{W\z) dM^^'dz. 

JJ M,z 



(25) 



The calculation of individual terms in this expression is dis- 
cussed in AppendbclB] The likelihood function implicitly de- 
pends on the cosmological parameters through the model 
of cluster mass ftinction (reflecting the growth, normaliza- 
tion, and shape of the density perturbation power spectrum), 
through the cosmological volume-redshift relation which de- 
termines the survey volume, and through the distance-redshift 
and _E(z) relations which affect our cluster mass estimates. The 
best fit parameters are obtained by maximizing the likelihood 
function in eg. (25]. We also can use standard methods (e.g. 
Cash 1979| to estimate uncertainties of the model parameters. 
Ihe aidvantage of this approach is that we do not use any bin- 
ning in either mass or redshift. 

In addition to the best fit parameters and confidence inter- 
vals, it is also useful to be able to characterize the goodness 
of fit. Even though the likelihood function cannot be used for 
this purpose directly, we can utilize it to obtain an effective 
for every cosmological model. First, we note that essentially 
all the cosmological information provided by the cluster mass 
ftinction is the normalization and slope of the Unear perturba- 
tions power spectrum at ~ 10 Mpc scales. Statistical quaUty of 
our sample is sufficient to fit Cg independently in 4 redshift bins 
(the local sample and the high-z subsamples z = 0.35 - 0.45, 
0.45 - 0.55, and 0.55 - 0.9) and tilt to the entire sample. In- 
dividual best-fit values of as should be consistent within the 
errors if the background cosmology is "correct"; similarly, tilt 
(relative to the best-fit slope constrained by CMB data) should 
be consisten t with zero if we trust the CDM transfer function 
models (see |Eisenstein & Hu|1998| and references therein). To 
characterize how close the tilt is to and individual erg's to a 
constant value, we can take advantage of the fact that the de- 
viation of the quantity C = -2 In L from the minimum has 
statistical properties of the distribution (Cash 1979). The 
effective "tilt" component of the total x^ can be computed as 



x]{0) = min C(0, ffg, 0) - min C{t, (Tg, 0), 



(26) 



where 6 is the vector of cosmological parameters other than 
tilt and normalization of the power spectrum. Similarly, the 
effective x^ component for the evolution in the normalization 
of the mass function is 

xloiiO) = C;(0, ds, 9) - min C^O, ag, 6), (27) 

J (Tg 

This is equivalent to the rule of transformation of the probability density 
function under change of variables, p{y) dy = p{x) dx. 



22 



VIKHLININ ET AL. 



where summation is over several sufficiently wide redshift bins 
and ds is the best-fit value to the entire sample. Adding these 
terms, we obtain the total effective for the cluster mass func- 
tion data, 



(28) 



This effective can be used to check how consistent are the 
model and observed mass functions in terms of general shape 
and evolution in the normalization. 

A detailed discussion of fitting cosmological parameters to 
our cluster data will be presented in Paper III. Here, we quote 
only the results of fitting the power spectrum normalization 
for our reference cosmology, = 0.746 ± 0.009 (purely sta- 
tistical uncertainties). The best fit models are shown by sohd 
Unes in Fig. 16 - 18 



8. SYSTEMATIC ERROR BUDGET 

We conclude the analysis with a summary of the systematic 
error budget in the mass function measurements. We begin 
with a discussion of several sources of observational uncer- 
tainties (those affecting measurements of the basic cluster pa- 
rameters), and then summarize the modeling uncertainties — 
those related to the mass vs. proxy relations and determination 
of the survey volumes. 

8.1. Observational Uncertainties 
8.1.1. Calibration Uncertainties 

The accuracy of the basic X-ray observables — average T 
and soft X-ray flux — is Umited by absolute calibration of the 
Chandra effective area. The calibration parameters most rele- 
vant for our study is the absolute value of the soft-band effec- 
tive area and the relative hard- to soft -band calibration. Thanks 
to the great effort put into calibration of the Chandra tele- 
scopes, both on the ground and in flight, the associated un- 
certainties are small, but they still need to be discussed for the 
sake of completeness. 

The absolute soft-band (~ 0.5-2 keV) effective area affects 
the measured cluster luminosities and gas masses, Lx oc A^^, 
Mgas oc A^'^l'^. The largest source of uncertainty in Asoft is in- 
flight contamination of the ACIS optical blocking filters by a 
hydro-carbon compound. Fortunately, in our energy band of 
interest, this contamination can be accurately measured as a 
function of time and position using the on-board calibration 
source, and so the soft-band effective area can be brought to 
its absolute pre-flight cahbration, which is accurate to r* 3% 
pdgar & Vildilinin 2004 1. The effect of such uncertainties on 
the derived mass function (through the mass proxies and the 
Lx - M relation) is negligible. The validity of the soft-band 
calibration is indirectly confirmed by the excellent agreement 
in the Chandra and ROSAT flux measurements. 

The relative hard-to-soft area calibration affects tempera- 
tures and hence hydrostatic total mass measurements. We will 
characterize this effect approximately by the relative change 
of measured temperatures ST^-^y = AT/T for the 5 keV 
clusters. The hydrostatic Mtot measurements are affected as 
A M500/M5 00 = 3/2 STcai (see, e.g.. Appendix A in Vikhlinin 



et al.|2006| . This uncertainty is transferred to our mass func 



tion determinations because all mass vs. proxy relations are 
calibrated using hydrostatic M,ot measurements. 

Calibration uncertainties for the cluster temperatures can- 
not be characterized exactly. Approximate estimates can be 
made from comparison of the values derived by different tele- 
scopes caUbrated independently or by looking at the effect of 



the most relevant "fudge" factors for Chandra. The systematic 
difference between XMM-Newton and Chandra temperatures 
is approximately 7% (V05I. The largest remaining Chandra 
calibration uncertainty is, as of this writing, related to the ef- 
fect of the 10 - 20 A hydro-carbon overlayer on the X-ray mir- 
rors. Experimenting with variations of the overlayer model, 
we find that the range of possible temperature variations is 
-6% < ^Tcai < 0, and STa^i is nearly independent of the cluster 
temperature and redshift. This would be equivalent to up to 
-9%, z-independent shift in the mass scale. 

8.1.2. Astronomical Uncertainties 

In addition to calibration uncertainties, we checked a num- 
ber of "astronomical" effects which also could affect the mea- 
surement of basic cluster properties. The effects that we 
checked and determined to be negligibly small include uncer- 
tainties in the Galactic interstellar absorpti on measurements 
(based on neutral hydrogen 21 cm maps IDickey & Lock- 



man 19901; absorption by ionized (warm) ISM in the Galaxy 
(Reynoldsj|1993jl; difference between plasma spectral codes; 
possible variations of the He abundance around the cosmic av 



erage (our conclusion is based on the analysis of ]Peng & Nagai 
,2008) . 

The only effect which is marginally significant is the possi- 
ble evolution of the ICM metallicity. Since the statistics in the 
data for our high-z sample are insufficient for ICM metallicity 
measurements, we assumed in each object that the metalUc- 
ity is equal to 0.3 Solar, approximately the mean value for the 
low-z population. If in fact there is an evolution in the heavy 
element metal abundance, our derived values for Tx and Mg^s 
are affected slightly. For example, if the mean abundance for 
high-z clusters is 0.15 Solar, the derived Tx and Mgas will be 
higher by +4%, and -i-l%, respectively. Such a trend (w hich is 



probably outside the r ange allowed by the data, see Tozzi et al. 



2003| Maughan et al. 2008 ), will be equivalent to changing the 



mass scale tor our high-z sample by +3%, +1%, and +6% if 
masses are estimated through the Mtot - Yx, Mtot - Afg^s, and 
Mtot - Tx relations, respectively. 

8.2. Modeling Uncertainties 

Modeling uncertainties in the mass function measurements 
can be separated into two components, 1) how accurately we 
can predict the survey volume for clusters of a given mass, and 
2) how accurately we can derive cluster masses from the data. 
The first component mainly depends on the accuracy of the 
Lx - Mtot relation, and the second, on the Mtot vs. proxy re- 
lation. All these uncertainties were discussed in detail above 
and so we provide only a summary here. 

8.2.1. Uncertainties in V (M) 

Uncertainties in the survey volume mainly depend on how 
accurately we can recover the Lx - Mtot relation from the data, 
assuming that masses are accurately reconstructed from the 
Yx, Mgas, or Tx proxies. The effects of these uncertainties on 
V{M) are considered in §[5] The largest error is related to the 
measurement of the evolutionary factor (eq.|2TJ and amounts 
to ±22% in volume for the median mass in our high-z sam- 
ple, ~ 2.1 X 10^* h^^ M© (Fig. 15 1. This source of error is sta- 
tistical in nature (related to measurement uncertainties in the 
Lx - Mtot parameters). It can therefore be added in quadra- 
ture to the purely Poisson errors (±26% in the cumulative mass 
function for the same mass threshold), resulting in a moderate 
increase in the statistical errorbars. Although it is possible to 
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TABLE 4 

Summary of main systematic uncertainties in the cluster mass function 



Source of error 



Affects 



Uncertainty 



ref 



§|5]|Paper I| 



§5 



§?1.1 



ITT 



400d selection function V(M) for high-z sample ±3% 

Scatter in Lx - Mtot V{M) ±10% 

Evolution in Ix - Mtot V(M) for high-z sample ±22% for median mass 

Chandra calibration Mo -9% - 

ICM metallicity vs. z My(z), Ma{z), Mr{z) +3%, +1%, +6%, respectively 

Accuracy of X-ray hydrostatic mass estimates Mo 0-9% 

Departures from self-similar evolution Mx{z) ±7% 

Departures from self-similar evolution My{z) ±5% 

fgjz) Me(z) ±5% 

Note. — The volume uncertainties are quoted for the median mass in the sample. The Mtot uncertainties are separated into calibration of the 
low-zmass vs. proxy relations (Mo) and uncertainties in the evolutions of these relation, M(z). Mf (z), Mq(z), and My(z) stand for total masses 
estimated using Tx, Mgas, and Yx, respectively. The evolutionary uncertainties are quoted at z = 0.5. 



include these uncertainties approximately in the cluster likeli- 
hood function or effective x^y a more accurate estimate of their 
effect on the final results can be obtained by repeating the en- 
tire analysis procedure with the evolution factor varied within 
its measurement errors. We take this approach and we will 
quote the associated parameter uncertainties in Paper IIL 

Other sources of uncertainty from the Lx - Mtot modeUng, 
such as the exact scatter in the relation, functional form of the 
evolution term, etc., are comparably small. The accuracy of 
statistical calibration of the 400d survey selection function also 
makes a negligible contribution, ±3%, to the volume error (^Pa-j 
per 



8.2.2. Uncertainties in derived Mtot 

Separate sources of uncertainties are related to potential bi- 
ases of the Mtot estimates from X-ray proxies. Note that these 
biases will have little effect on the volume computations for 
the given cluster (if we change the estimated Mtot's, we need to 
refit the Lx - Mtot relation and the net effect will be that the 
volume for the given cluster is almost unchanged). In a sense, 
the V(M) systematics move the cumulative mass functions in 
Fig.fielup and down, while the potential Mmt biases shift the 
massTunction along the M axis. 

The mass biases can be naturally separated into two compo- 
nents. The first is related to calibration of the Mtot vs. proxy re- 
lations for low-z clusters. Assuming that the evolution in the 
relation is nominal, such biases will shift the low and high-z 
mass function by the same amount, or, equivalently, will af- 
fect the overall normaUzation, but not the evoluti on in t he co- 
moving number density. As we discussed above (§ 4.3. com- 
parison of X-ray and weak lensing masses provides a good es- 
timate, ±9% in mass, for such biases. 

The second source is departures of the evolution in the Mtot 
vs. proxy relation from the assumed forms. Since evolution 
is negligible within the low-z sample, such biases are impor- 
tant only for the high-z mass function and thus will affect the 
derived evolution in the cluster number density, but not the 
overall normaUzation of the mass functions. We estimate that 
by z = . 5, the evolutionary Mtot biases can be up to ± 7% in the 
Mtot - Tx rel ation, and ±5% f or the Mtot - Mgas and Mtot - Yx 
relations (§§|^ 



4.L3 



4.2.2 



4.3 



respectively). In the case of the 

Mtot - Tx we also need to add a ±6% uncerta inty re lated to the 



potential evolution in the ICM metalUcity (§ 8.1.2 1 



The estimated uncertainties in the Mtot calibration cannot 
be easily included in the likelihood function. Instead, we check 
(Paper III) how they affect the cosmological fit by repeating the 
entire analysis procedure with the parameters Mtot vs. proxy 
relations within the bounds specified above. Note also that the 



use of three different mass proxies, each with its own bias, pro- 
vides a good consistency check, because results obtained with 
different proxies can be compared to each other to check for 
biases. 

9. SUMMARY 

We presented a report on data analysis procedures leading 
to a measurement of the galaxy cluster mass functions using 
Chandra observations of statistically complete samples of low 
and high-z clusters originally selected in the X-ray data from 
ROSAT. This measurement relies on a careful selection of the 
parent samples, rather detailed Chandra observations of se- 
lected objects, and using several robust X-ray proxies for the 
total cluster mass (Yx. Mg^s, Tx). 

The scaling relations between proxies we use and Mtot 
mostly follow the predictions of the self-similar theory, a very 
basic and hence reUable model. We used advanced high- 
resolution numerical simulations to test the predictions of this 
theory with regard to our proxies; these simulations indicate 
that only small corrections are necessary, which we use cau- 
tiously. At low redshifts, the Mtot vs. proxy relations were cal- 
ibrated by detailed Chandra observations of a sample of re- 
laxed clusters spanning a wide range of mass; our Chandra 
results were cross-checked against recent weak lensing mea- 
surements. 

As a part of this project, we derive a relation between cluster 
mass and total X-ray luminosity, using large statistically com- 
plete samples and properly taking into account the Malmquist 
bias. The relation is adequately described by a single power law, 
substantial log-normal scatter, and evolution of the power law 
normaUzation following £(z)''^^ — assuming that the evolu- 
tion in the Mtot ~ Yjf relation is exactly seLf-simUar as we use 
Yx to estimate cluster masses. 

We present the cluster mass functions estimated assuming 
a "concordant" ACDM cosmology. These data shows a sig- 
nificant evolution in the cluster comoving number density at 
a fixed mass threshold, by a factor of « 5 at M500 = 2.5 x 
10^" h-^ M© between z = and 0.5. 

Finally, we provide a summary of estimated systematic un- 
certainties in our mass function measurement. Most source 
of systematics lead to corrections which are smaUer than the 
Poisson errors in our data. The main exception is uncertain- 
ties in calibration of the absolute mass scale at low redshifts 
but it has Uttle impact on the measurment of evolution in the 
cluster number density. 

The evolution in the cluster mass function reflects the 
growth of density perturbations and can be used for the 
cosmological constraints complementing those from the 
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distance-redshift relation. The cosmological modeling of these 
data will be discussed in a future paper. 
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APPENDIX 



A. CORRECTION OF THE LUMINOSITY-MASS RELATION FOR THE MALMQUIST BIAS 
Stanek et al. ( 2006 jl recently pointed out that observational determinations of the mass-luminosity relation can be significantly 
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affected by Malmquist bias because of the flux-Umited nature of most of the available cluster samples. IStanek et al. were interested 



in the relation where Lx is the independent variable; such a relation is useful for Mtot estimates using Lx as a proxy. Corrections 
for Malmquist bias in this case lead to complicated computations involving the cluster mass function model. We, instead, are 
interested in computing the survey volume for objects of given mass, for which we need to treat Mtot as independent variable and 
compute luminosity for a given mass (eq.jTsj). The calculations of the Malmquist bias are then much simpler and can be done 
independently of the mass function modelmg. 
We assume that the scatter in L for fixed M has a log-normal distribution, 

^ dN I (lnI-lnIo)^\ , . 

where Lq is the average luminosity for the given mass. Typically, Lq is a power law of mass, Lq oc M", but we do not make this 
assumption in the calculations below. 

A. 1 . Corrections for Malmquist Bias in Non-Evolving Lx - M Relation 

Calculations of the Malmquist bias are particularly simple if the evolution in the Lx-M can be neglected, e.g., Lq (M) is the same 
at all redshifts within the sample. This situation is appUcable for the analysis of the low-z samples and in our case, for establishing 
the low-z reference relations. 

Let us assume that the survey volume as a function of the object luminosity can be approximated as a power law 

V{L) oc l\ (A2) 

For example, in the case of Euclidean space and a pure flux- limited survey, V{L) oc L^l^ exactly. If there is a lower redshift cutoff 
in the survey, 5 # 3/2 at the low-L end, even in Euclidean space. Likewise, if there is a higher-redshift cutoff, 5 ^ in the high-L 
end (sample becomes volume-limited). 
The Lx-M relation is usually fit in the In M - In L coordinates, so we need to compute the bias in In L for given M: 

/_'^(lnl - InLo) p(lnl) V{\nL)d\nL fZ^ exp{-x'/2cy') exp{xS)dx 

' " fZp{^nL)V{\nL)dlnL fZexpi-x^la^) exp{xS) dx ' ^ ' 

where we used the substitution x = InL - InLo. The integrals can be worked out analytically, 

(InL) -InLo = — '—^ = d(J^. (A4) 

\/2n a exp{S^ 

The log-normal scatter in the relation, cr, is usually unknown apriori and thus it should be estimated from the rms scatter around 
the best-fit relation in the In M - In L plane. Fortunately, the flux-Umited survey does not introduce bias in the scatter, i.e. ffobs = 
as we now demonstrate. 

ffi, = ((InL - InLo)') - (Inio - (Inl))' = ((InL - InLo)') - cj' (A5) 

exp(-x^/2ff^) exp(x5) , 

((InL-lnlo) ) = ^^=^^5 P — 7^-^ — P-T^ = a\l + S^a^) (A6) 

J_^exp{-x^/2a^) exp{xS)dx 



(c.f eq. 



A3 



and 



A4l and so a^^,^ 



A.2. Correction for Individual Clusters 

The bias computations from the previous section cannot be appUed if we aim to model also the evolution in the Lx-M relation, 
because in this case we need to compute the bias in a fixed narrow interval of z where V{L) cannot in general be represented 
with a power law (e.g., for an ideal flux-hmited survey, V{L) in a narrow interval of z is close to a step-function). An alternative 
(approximate) approach is to compute the expected biases in L for individual clusters, as considered below. A better approach is 
to model all effects of selection through the Ukelihood function, as discussed in the next section. 

Let us assume that the survey has a single flux threshold, /min (i.e., the cluster is always detected if / > /min and not detected if 
/ < /min)- The average luminosity bias of detected clusters with a given mass is 

, , f°° X exp{-x^ /2a^) dx exp (-x^:j2a^) 

(InL - InLo) = (In/ - In/o) = '-^ ^ ' = , t it ^^^^ 

A„i„ exp(-x2/2o-2) dx ^faj2 erfc (xtnin/ {o^)) 

where /o is the flux corresponding to the nominal luminosity, Lq, given by the Lx-M relation, and x^^n = In /min - I n fp . For 



A7 



/o » /min (very massive clusters), the bias is as expected. For very low mass clusters (Lq ^ 0, /q ^ 0, x^in oo), eq. 
(In/ - In fo ) ~ In fmin - In/o (i.e. all detected clusters have fluxes just above the survey threshold). 

Equation ]A7| is easily generalized for the case when the survey selection probabihty is a smooth function of flux (as is the case 
for the 400d sample): 

r°° xPseiC^ + In fo) exp(-A:^/2ff^) dx 
(InL - InLo) = (In/ - In/o) = 7 /"^ ^ ; ^> (A8) 

i_oo -Psei(* + In/o) exp(-x2/2(j2) dx 
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Fig. 19. — Distribution of the Lx - M parameters recovered from mock catalogs (see text for details). The points show the deviations of the best-fit parameters 
in each realization from the input values, and the histograms show the probability density distribution for each parameter. The right panel shows the deviations of 
best-fit luminosities at z = and 0.55 for the median mass in the samples (the z = results are equivalent to those for the overall normalization, Ao) for fits with 
a fixed at the nominal value (see text). 



A. 3. Likelihood Function and Fitting Procedure 

The best way to treat the Malmquist bias in modeling the relation is through a proper definition of the likelihood function. Let 
Psei (In/) be the survey selection efficiency as a function of flux. The average luminosity-mass relation gives a "nominal" luminosity 
for clusters of given M, which corresponds to a "nominal" flux /q. The probability density function for the cluster to have flux / is 



dP 



C exp 



|_an 



/-In/o)^ 



dlnf \ 20-2 

where C is the normalization coefficient defined so that the total probability is 1 



C" 



exp 
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For a survey with a single sharp flux limit [i.e. those with Psei(/) = S(/~/min)]. 
The total likelihood function, C, is the product of dP/ d I n / for ind ividual d 



the usual x for finding the best fit and confidence regions (Cash 1979|. From I A9 1, we have 

(ln/,-ln/o.,)' 



A10|becomes C = (| erfc[ln(/min//o)/(ffx/2)])-i. 
liste rs. The quantity -2 In C can be used in place of 



2ln£ = ^ 



■2lnC, -2lnPsei(ln/i)- 



(All) 



The first term on the right-hand side of Al 1 is the usual unweighted and the extra two terms are corrections for the Malmquist 
bias. Masses of individual clusters and parameters of the Lx - M relation enter the likelihood function impUcitly, through calcu- 
lating the "nominal" luminosities [e.g. InLo = A\n M + B + evol(z)] which are then converted to fo's. 

Parameters of the Lx - M relation can be obtained from finding the global maximum of the likeUhood function I All| . In 
practice, we use a multi-step procedure to fit the Lx - M parameters. The scatter, overall normalization, and power law slope are 
determined from the low-z data where the measurement uncertainties can be neglected relative to intrinsic scatter. The best-fit 
scatter is corrected by a factor of {N/{N - l)y/^, the expected bias of the Maximum Likelihood estimate, where N is the number 
of clusters in the low redshift sample. Then, with Aq, a, and a fixed, the evolutionary term (y) is determined from the fit to the 
high-z data. The procedure is iterated several times until convergence. 

To assess how well our fitting procedure recovers the parameters of the Lx - M relation, we applied it to mock cluster samples. 
The mock samples were designed to mimic closely our actual low- and high-z samples. The cluster masses and redshifts were 
drawn from the mass function model computed in the D.m = 0.28, A = 0.72, ffg = 0.79 cosmology. The luminosities were then 
simulated assuming a mass-luminosity relation with parameters {\nAo,a,y,a) = (47.4, 1.6, 1.8,0.4) (c.f our best-fits parameters 
in eq.|22|), the observed fluxes computed for this background cosmology, and finally, the selections appropriate for the ROSAT 
All-Sky and 400d surveys were appUed. The simulated lists and the real sample have approximately the same number of clusters. 

The distribution of the deviations of best-fit parameters from their nominal input values is shown in Fig. 1 9 We are able to recover 
all parameters, normaUzation, scatter, evolution term y, and the slope, a, (not shown in the figure), without significant biases. The 
widths of the distributions, AlnAo = 0.085, Aa = 0.14, Ay = 0.42, and Acr = 0.039, correspond to the expected measurement 
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uncertainties for each parameter. Note that the uncertainties for individual parameters are correlated. For example, the low- 
z normalization is obviously anti-correlated with the evolution parameter, y. The scatter is anti-correlated with both the low-z 
normalization and evolution because the Malmquis bias corrections are oc a^. These correlations have to be kept in mind when 
we estimate the uncertainties in the survey volume computations associated with the measurement errors of the Lx - M relation. 
In particular, the most important parameters for V{M) are the average Lx's for the median mass of our low and high-z samples. 
For nearby clusters, this corresponds simply to the uncertainties in Aq, but for high-z clusters, this is a complex combination of 
uncertainties in Aq, a, and y. The results for the average normalizations are shown in the right panel of Fig. T9l We are able to 



recover the true average luminosities without a significant bias and with uncertainties of r* 8.0% and 10.5% aflow and high-z, 
respectively. 

B. LIKELIHOOD FUNCTION CALCULATIONS 
B.l. Calculation of p{M"',z) 

Generally, the probability density distribution of the observed masses is given by convolution of the model distribution of the 
true masses and the scatter between M'^'" and M"'"'^. The former is simply the product of the theoretical mass function dn/iiM*™' 
and survey volume at this redshift, dV{M^^"^ ,z)/dz (the calculation of dV{M)/dz is discussed in §[5}, and so we have 

dn c^y(M•™^z)' 
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(Bl) 



A log-normal distribution is a good approximation for the scatter in the mass estimates, and so the convolution in eq.( |Bl| can be 
written as 
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The function p(M'^'",z) enters the expression for likelihood in summation over ob serv ed clusters (first term in eq.(25|) and in 
the integral over the observed range (second term in the same equation). Equation B2 should be evaluated numerically, but the 
calculation of all the terms is straightforward. The term dn/d\n M"^"*^ is the differential duster mass function at the given redshift. 
Cosmological parameters enter th e cal culation of (iy(M"^"'^,z) through the volume-redshift relation and the evolving cluster 
Lx - M relation which is derived ( ^5.1| using Lx and Mtot estimated in this cosmology. 

B.2. Integration of p{M"*,z) 
We now need to evaluate the second term in eq.|[25], 

M„„ Z,„ax 
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Using 



B2 



we have 
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Changing the order of integration, we have 
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The last term in this equation is the integral of the normal distribution (can be computed numerically using the library error 
function): 
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so finally. 
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The quantities cr'^'*' are the total uncertainties of the mass estimates, including intrinsic scatter and measurement errors, 



^est _ ( 1 2 \ 

"i ~ V^intr "meas.i^ 



1/2 



In practice, a"' are not the same because at least cr„eas,i varies from cluster to cluster. A reasonable strategy to include these 
variations is to replace . . , . . .) with an average over all sample members. 
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We use this equation to evaluate the second term in the expression for likelihood function (eq.(25] 



